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Preface 


Tt is hard to fit a graduate course on electromagnetic theory into one 
semester. On the other hand, it is hard to stretch it to two semesters. This 
text is based on a two-semester MIT ccurse designed to solve the problem 
by a compromise: Allow approximately one and a half semesters for 
electromagnetic theory, including scattering theory, special relativity and 
Lagrangian field theory, and add approximately one-half semester on 
gravitation. 

It is assumed throughout that the reader has a physics background 
that includes an intermediate-level knowledge of electromagnetic pheno- 
mena and their theoretical description. This permits the text to be very 
theory-centered, starting in Chapters 1 and 2 with the simplest experi- 
mental facts (Coulomb’s law, the law of Biot and Savart, Faraday’s law) 
and proceeding to the corresponding differential equations; theoretical 
constructs, such as energy, momentum, and stress; and some applications, 
such as fields in matter, fields in the presence of conductors, and forces 
on matter. 

In Chapter 3, Maxwell’s equations are obtained by introducing the 
displacement current, thus making the modified form of Ampére’s law 
consistent for fields in the presence of time-dependent charge and current 
densities. The remainder of Chapters 3-5 applies Maxwell’s equations to 
wave propagation, radiation, and scattering. 

In Chapter 6, special relativity is introduced. It is also assumed here 
that the reader comes with prior knowledge of the historic and experi- 
mental background of the subject. The major thrust of the chapter is to 
translate the physics of relativistic invariance into the language of four- 
dimensional tensors. This prepares the way for Chapter 7, in which we 
study Lagrangian methods of formulating Lorentz-covariant theories of 
interacting particles and fields. 

The treatment of gravitation is intended as an introduction to the 
subject. It is not a substitute for a full-length study of general relativity, 
such as might be based on Weinberg’s book.’ Paralleling the treatment 


‘Steven Weinberg, Gravitation and Cosmology, New York: John Wiley & Sons. 1972. 
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of electromagnetism in earlier chapters, we start from Newton’s law of 
gravitation. Together with the requirements of Lorentz covariance and 
the very precise proportionality of inertial and gravitational mass, this law 
requires that the gravitational potential consist of a second-rank (or 
higher) tensor. 

In complete analogy with the earlier treatment of the vector (electro- 
magnetic) field, following Schwinger,” we develop a theory of the free 
tensor field. Just as Maxwell’s equations required that the vector field be 
coupled to a conserved vector source (the electric current density), the 
tensor field equations require that their tensor source be conserved. The 
only available candidate for such a tensor source is the stress-energy 
tensor, which in the weak field approximation we take as the stress-energy 
tensor of all particles and fields other than the gravitational field. This 
leads to a linear theory of gravitation that incorporates all the standard 
tests of general relativity (red shift, light deflection, Lense~Thirring effect, 
gravitational radiation) except for the precession of planetary orbits, 
whose calculation requires nonlinear corrections to the gravitational po- 
tential. 

In order to remedy the weak field approximation, we note that the 
linear equations are not only approximate, but inconsistent. The reason 
is that the stress-energy tensor of the sources alone is not conserved, since 
the sources exchange energy and momentum with the gravitational field. 
The remedy is to recognize that the linear equations are, in fact, consistent 
in a coordinate system that eliminates the gravitational field, that is, one 
that brings the tensor g,, locally to Minkowskian form. The consistent 
equations in an arbitrary coordinate system can then be written down 
immediately—they are Einstein’s equations. The basic requirement is that 
the gravitational potential transform like a tensor under general coordinate 
transformations. 

Our approach to gravitation is not historical. However, it parallels the 
way electromagnetism developed: experiment — equations without the 
displacement current; consistency plus the displacement current > 
Maxwell’s equations. It seems quite probable that without Einstein the 
theory of gravitation would have developed in the same way, that is, in 
the way we have just described. Einstein remarkably preempted what 
might have been a half-century of development. Nevertheless, I believe 
it is useful, in an introduction for beginning students, to emphasize the 
field theoretic aspects of gravitation and the strong analogies between 
gravitation and the other fields that are studied in physics. 

The material in the book can be covered in a two-term course without 
crowding; achieving that goal has been a boundary condition from the 
start. Satisfying that condition required that choices be made. As a conse- 


*J. Schwinger, Particles, Sources and Fields, Addison-Wesley, 1970. 
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quence, there is no discussion of many interesting and useful subjects. 
Among them are standard techniques in solving electrostatic and magneto- 
static problems; propagation in the presence of boundaries, for example, 
cavities and wave guides; physics of plasmas and magnetohydrodynamics; 
particle motion in given fields and accelerators. In making these choices, 
we assumed that the graduate student reader would already have been 
exposed to some of these subjects in an earlier course. In addition, the 
subjects appear in the end-of-chapter problems sections. 

My esteemed colleague Kenneth Johnson once remarked to me that 
a textbook, as opposed to a treatise, should include everything a student 
must know, not everything the author does know. I have made an effort 
to hew to that principle; I believe I have deviated from it only in Chapter 
5, on scattering. I have included a discussion of scattering because it has 
long been a special interest of mine; also, the chapter contains some 
material that I believe is not easily available elsewhere. It may be omitted 
without causing problems in the succeeding chapters. 

The two appendices (the first on vectors and tensors, the second on 
spherical harmonics) are included because, although these subjects are 
probably well known to most readers, their use recurs constantly 
throughout the book. In addition to the material in the appendices, some 
knowledge of Fourier transforms and complex variable theory is assumed. 

The problems at the end of each chapter serve three purposes. First, 
they give a student an opportunity to test his or her understanding of the 
material in the text. Second, as I mentioned earlier, they can serve as an 
introduction to or review of material not included in the text. Third, they 
can be used to develop, with the students’ help, examples, extensions, 
and generalizations of the material in the text. Included among these are 
a few problems that are at the mini-research-problem level. In presenting 
these, I have generally tried to outline a path for achieving the final result. 
These problems are marked with an asterisk. I have not deliberately 
included problems that require excessive cleverness to solve. For a teacher 
searching for a wider set of problems, I recommend the excellent text of 
Jackson,’ which has an extensive set. 

One last comment. I have not hesitated to introduce quantum inter- 
pretations, where appropriate, and even the Schroedinger equation on 
one occasion, in Chapter 3. I would expect a graduate student to have 
run across it (the Schréedinger equation) somewhere in graduate school 
by the time he or she reaches Chapter 3. 

Finally, I must acknowledge many colleagues for their help. Special 
thanks go to Professors Stanley Deser, Jeffrey Goldstone, Roman Jackiw, 
and Kenneth Johnson. I am grateful to the late Roger Gilson and to 
Evan Reidell, Peter Unrau, and Rachel Cohen for their help with the 


3J. D. Jackson, Classical Electrodynamics, New York: John Wiley and Sons. 1962. 


xii Preface 


manuscript, and to Steven Weinberg and David Jackson for their excellent 
texts, from which I have freely borrowed. 


Francis E. Low 
Cambridge, Massachussetts 
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CHAPTER 1 


Electrostatics 


1.1. COULOMB’S LAW 


In the first half of the eighteenth century, the basic facts of electrostatics 
were sorted out: the existence of two signs of transferable electric charge; 
the additive conservation of that charge; the existence of insulators and 
conductors. The process is described in a lively way by Whittaker.' In the 
next half-century, the quantitative law of repulsion of like charges was 
determined by Priestley and extended to charges of both signs by Cou- 
lomb. By 1812, with the publication of the famous memoir of Poisson, 
the science of electrostatics was understood almost in its present form: 
potentials, conductors, etc. Of course, the specific knowledge of the nature 
of the carriers of electric charge awaited the experimental discoveries of 
the late nineteenth and early twentieth century. 

The resultant formulation of electrostatics starts from Coulomb’s law 
for the force between two small particles, each carrying a positive or a 
negative charge. We call the charges q, and qz, and their vector positions 
rı and rz, respectively: 


oS REL (1.1.1) 


F(2 on 1) = ae ar 
p= #2 


and 


F(1 on 2) = —FQ on 1). (1.1.2) 


Like charges repel, unlike attract. Most important, the forces are linearly 


"A History of the Theories of Aether and Electricity, Tomash Publishers (American 
Institute of Physics, New York), 1987 (ist edition, 1910). 
Mém. de l'Institut, 1811. 
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additive. That is, there are no three-body electrostatic forces.” Thus, with 
three charges present, the total force on 1 is found to be 


ri — T? 
oot 914s 
[ri a ra| 


< 


eS See (1.1.3) 


F(on 1) = qiq2 7 
[ri = r| 
If rə and r3 are close together, the form of (1.1.3) goes over to (1.1.1) 
with q2+3 = q2 + q3. Thus, charge is additive. It is also conserved. That 
is, positive charge is never found to appear on some surface without 
compensating positive charge disappearing or negative charge appearing 
somewhere else. 

Equation (1.1.1) serves to define the electrostatic unit of charge. This 
is a charge that repels an equal charge 1 cm away with a force of 1 dyne. 

It is useful to define an electric field at a point r as the force that would 
act on a small test charge ôq at r divided by ôq, where the magnitude of 
ôq is small enough so that its effect on the environment can be ignored. 
Thus, the field, a property of the space point r, is given by 


BG = ong) (1.1.4) 
ôq 
and, by (1.1.3) generalized to many charges, 
E(r) = È q 5. (1.1.5) 
i |r = r| 


We can generalize (1.1.5) to an arbitrary charge distribution by defining 
a charge density at a point r as 


Ae (1.1.6) 
ôr 


where ôq is the charge in the very small three-dimensional volume element 
ôr. The sum in (1.1.5) turns into a volume integral: 


Bir) = | ar 2 pee) (1.1.7) 


[r - r'| 
where dr’ represents the three-dimensional volume element. Note that in 
*This statement does not hold at the microscopic or atomic level. For example, the 


interactions between atoms (van der Waals forces) include three-body forces. These are, 
however. derived from the underlying two-body Coulomb interaction. 


1.1. Coulomb’s Law 3 


spite of the singularity at r' =r, the integral (1.1.7) is finite for a finite 
charge distribution, even when the point r is in the region containing 
charge. This is because the volume element dr’ in the neighborhood of a 
point r goes like |r’ — r|? for small |r — r'|, thereby canceling the singular- 
ity. 

We can return to the form (1.1.5) by imagining the charge distribution 
as consisting of very small clumps of charge q; at positions r,; the quantity 


qi= Í dr’ p(r’) (1.1.8) 


ith clump 


is the charge q; in the ith clump. Its volume must be small enough so that 
r’ in (1.1.7) does not vary significantly over the clump. 

The mathematical point charge limit keeps the integral 
| pdr’ =q constant as the size of the clump goes to zero. It is useful 


to give a density that behaves this way a name. It is called the delta 
function, with the properties 


êr -r')=0, rr (1.1.9) 


and 


far S(r—r') =1 (1.1.10) 


provided the r’ integration includes the point r. Of course, 6(r) is not a 
real function; however, as we shall see repeatedly, its use leads to helpful 
shortcuts, provided one takes care not to multiply (r) by functions that 
are singular at r = 0. 

Evidently, the fields of surface and line charge distributions can be 
written in the form (1.1.7), with the charge density including surface and 
line charge (i.e., one- and two-dimensional) delta functions. When the 
dimensionality of the delta function is in doubt, we add a superscript, thus 
8°(r3) for a point charge, 57(r2) for a line charge, and 6'(r,) for a surface 
charge; here, r3, r2, and r; represent three-, two-, and one-dimensional 
vectors, respectively. Note that 5°, 5°, and 5' can be expressed as products 
of one-dimensional delta functions. Thus, for example, 6°(r3) = 
5'(x)5*(y)8'(z), 6°(r2) = 8'(x)'(y), and 8'(r1) = 6'(x). 

Given the charge distribution p(r), (1.1.7) tells us how to calculate 
the electric field at any point by a volume integral—if necessary, numeri- 
cally. We might therefore be tempted to terminate our study of electrostat- 
ics here and go on to magnetism. There are, however, a large number of 
electrostatic situations where we do not know p(r), but are nevertheless 
able to understand and predict the field configuration. In order to do that, 
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however, it is necessary to study the differential equations satisfied by the 
electric field. 

We start by observing from (1.1.7) that the electric field can be derived 
from a potential (r). That is, 


E(r) = -V¢(r), (1.1.11) 
where 
(r) = ap oe) (1.1.12) 
jr—r'| 


Equation (1.1.7) follows from (1.1.11) and (1.1.12) since 


1 oe oe) ~ ð xy 20 Nel 
Talea Og (1.1.13) 
r Ox f oy dz/r 


(where ê., ê,, and ê. are unit vectors in the three coordinate directions) and 


d l a 1 x 
5 1.1.14 
axr ax Veta ye? ey? eye ( ) 
so that, with similar equations for y and z, 
1 eX ey ê- r 
oe A + 2 + me (1.1.15) 
and 
I = i 
7 = (1.1.16) 
jr—-r’| r-r} 
From (1.1.11) we learn that 
VxE=0 (1.1.17) 
since 
vVxVġ=0 (1.1.18) 


identically for any @. Of course, we could have derived (1.1.17) directly 
by taking the curl of (1.1.7). 

On the other hand, given (1.1.17), we can derive the existence of a 
potential. We define 
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or) = - fee’) ‘dY, (1.1.19) 

where f aY represents a line integral along an arbitrary path from the 
point ro (where œ is defined to be zero) to the point r. We show that o 


in (1.1.19) is independent of the path by calculating the difference of ¢ 
defined by two paths, P, and P;: 


b(n) ~ a(t) =~ | Eats fE 


Pi P2 


=~ pedi (1.1.20) 
a 
where fE - dl represents the line integral around a closed path C, given 


a 
by going from r, to r along P, and back from r to rp along P2. 
By Stokes theorem, 


pE-dt=[VxE-ds, (1.1.21) 
c Ss 
where dS is any oriented surface § bounded by C. Thus, since V x E = 
0, &; = $- and the integral defining @ is independent of the path from 


Fo tor. 
We note that changing ro corresponds to adding a constant to ¢: 


r 


$ru = - feas (1.1.22) 
and 7 
by, = - [ear 
so that : 
t= boy |E (1.1.23) 


with — PE - di the additive constant (it is independent of r). 
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Finally, it is clear that -V@ defined by (1.1.19) is the electric field. 
We show this for the x component: Let 


A 


dr=e,dx 


and choose the path tor + dr as ro to r followed by dr. Then 


ap _—— lr +é. dx) — e(r) 


ðx dx 
g [Bcd + Esdx- f E-dl 


dx 
= FE, (1.1.24) 


in the limit dx > 0. 

In general, a vector function of position (which goes to zero sufficiently 
rapidly as r — ~) is completely determined by its curl and its divergence. 
in our case, a charge density confined to a finite region of space will— 
according to (1.1.7)—gives rise to an electric field that goes to zero like 
1/r?; this is fast enough for the theorem to hold. (See Problem A.21.) We 
therefore turn to the calculation of V - E. 

For this purpose, we consider the field of a single point charge at the 
origin, 


E=q5 (1.1.25) 


V - E would appear to be given by 


re £5) 26) 40) 
ox \re dy \r ðZ 


1 3x? 1 3y? 1 3z? 
- (5-5) +(4-%) + (+ = =) =0. (1.1.26) 


r r r r r 


Equation (1.1.26) clearly holds for r + 0. The singular point r = 0 
presents a problem: Consider the electric flux through a closed surface § 
enclosing the charge at the origin, that is, the surface integral of the 
electric field over a surface S, 


r= | ds-E, (1.1.27) 


S 
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with the vector dS defined as the outward normal from the closed surface. 


The integral (1.1.27) is independent of the surface, provided the displace- 
ment from one surface to the other does not cross the origin. Thus, 


li -h= fasi:E- | asa- E 
Sy S2 
where dS, and dS, are outward normals viewed from the origin. The two 
surface vectors dS, and —dS, are the outward normals of the surface 


bounding the volume contained between S, and S2, provided S, is outside 
S2. Thus, 


h-h= | dS-B (1.1.28) 
and by Gauss’ theorem 
n-h= | dev-E-0 (1.1.29) 


since the space between the surfaces does not include the singular point 
at the origin. 

Consider first the integral (1.1.27) with the origin inside the surface. 
We choose the surface to be a sphere about the origin and find 


t= [e-ds=q |5 anr (1.1.30) 
Fy 
S 
where dQ is the solid angle subtended by dS. Thus, 
l=47q. (1.1.31) 


If S encloses several charges, we can calculate the contribution of each 
charge to J separately (since the fields are additive), yielding Gauss’ law: 


[E as=47 34, (1.1.32) 


where the sum is over all the charges inside the surface S. 
if the surface has no charges inside it, the integral fE - d§ is zero by 
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Gauss’ theorem: 
[e-as-|av-E=0 (1.1.33) 


since V- E = 0 away from charges. Clearly, however, V+ E cannot equal 
zero everywhere, since, if it did, the integral (1.1.32) would be zero 
instead of 47 È; qi. 

We can find the equation for V - E by considering finite charge density 
p(r). Then (1.1.32) tells us that for any closed surface, the flux through 
the surface is equal to 47 times the total charge inside the surface: 


[E as=47| arp, (1.1.34) 


S vV 


where the integral dr is over the enclosed volume. Gauss’ theorem applied 
to (1.1.34) gives 


| a(€- 470) =0 (1.1.35) 
v 
for any volume V. Thus, the integrand must be zero and we have the 
equation for the divergence of E: 


V-E=4np. (1.1.36) 


The special case of a point charge at the origin, for which p = q6(r) and 
E = q(r/r°), shows that V - (r/r*) acts as if 


Ve = 4r 8(r). (1.1.37) 
k 


Equation (1.1.36) yields an equation for the electrostatic potential : 
V-E=-V:Voọ=4rp or Vh = ~4rp. (1.1.38) 


This is known as Poisson’s equation. In a portion of space where p = 0, 
(1.1.38) becomes 


Ved = 0, (1.1.39) 


1.2. Multipoles and Multipole Fields 9 


which is called Laplace’s equation. A function satisfying Laplace’s 
equation is called harmonic. 

As we remarked earlier, given the charge density p, the potential ¢ 
is determined (up to a constant) by the integral (1.1.12). We have given 
the subsequent development in (1.1.13-1.1.39) for three reasons. 

First, the integral form (1.1.32) can be a useful calculational tool in 
situations where there is sufficient symmetry to make the flux integration 
trivial. These applications are illustrated in the problems at the end of 
this chapter. 

Second, the differential equation (1.1.38) can be used when the actual 
charge distribution is not known and must be determined from boundary 
conditions, as in the case of charged conductors and dielectrics. 

Third, the Coulomb law does not correctly describe the electric field 
in nonstatic situations, where we shall see that V x E is no longer zero, 
However, the divergence equation does continue to hold. 


1.2. MULTIPOLES AND MULTIPOLE FIELDS 


The electrostatic multipole expansion, which we take up in this section, 
provides an extremely useful and general way of characterizing a charge 
distribution and the potential to which it gives rise. Analogous expansions 
exist for magnetostatic and radiating systems [discussed in Chapter 2 
(Section 2.3) and Chapter 5 (Section 5.10), respectively]. 

As shown in Appendix B, the electrostatic potential outside of an 
arbitrary finite charge distribution can be expressed as a power series in 
the inverse radius 1/r: 


The /th term in the series is called a multipole field (or potential) of order 
l; it can, in turn, be generated by a single multipole of order /, which we 
now define, following Maxwell. 

A monopole is a point charge Qo; it gives rise to a potential [choosing 


(=) = 0] 


by = > (1.2.1) 


where fy is the location of the charge. 
A point dipole consists of a charge q at position rọ +1 and a charge 
—q at ry, where we take the limit I — 0, with Iq = p held fixed. p is called 
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the electric dipole moment of the pair of charges. The potential of a point 
dipole is given by 


6: = Lim| —t— - 4 } 
|r- ro- l| |r — ral 
1 1 


= -l4 V = pV 5 
|r — ro| |r — rol 


1.2.2) 


We separate p into a unit vector Î and a magnitude Q, with p = Q,/. 

We define higher moments by iterating the procedure: A quadrupole 
is defined by displacing equal and opposite dipoles, etc. Thus, the 2’th 
pole gives rise to a potential 


$= OA(-1'h, Vh Vet V (1.2.3) 


jr = rol 


The potential ¢, is specified by 2/ + 1 numbers: the polar angles 6, and 
azimuths g; of the / unit vectors, and the magnitude Q,. 

On the other hand, an arbitrary charge distribution p(r) generates an 
electrostatic potential 


gw) = | tA 


(1.2.4) 
Ir-r] 


which can, for r’ outside the charge distribution, be expanded in two 
equivalent ways. The first is 


or’) = 2 fore u(r) dro 24 (1.2.5) 


mo l! OX; xar’ 
where the harmonic polynomials Pets are defined in Appendix B: 
PY 0) = X;,Xi,.. Xy — (traces times Kronecker deltas) (1.2.6) 


where the traces are subtracted to make the tensor Pia traceless. The 
expansion (1.2.5) is then 


TE 
o(r’) = 2 a OH a a’) (1.2.7) 


where the potential $;, ,,, defined in (B.2.3), is 
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ð ð 1 
A PH (1.2.8) 
OX j, ax jr! 
and the Cartesian /th rank tensor QP is 
a f dr p(r) PP. (1.2.9) 


We call OSa the 2'th pole moment of the charge distribution. Since 
ae is an /th rank, traceless, symmetric tensor in three dimensions, the 
number of independent Oras is 21 + 1, as shown in (B.2). 


The second equivalent expression for (1.2.4) is 


olr) = y i dr p(r) P(r -r') (1.2.10) 
= 2 ae = cs 22 Oe (1.2.11) 

where the 2'th pole moments Q; are given by 
Qin = far PY Žm(0, p)r’. (1.2.12) 


Note that here also the number of independent Q; m's for each / is 2/ + 1. 
An obvious question to ask is whether the general potential given by 
(1.2.11) can be reproduced by a series of Maxwell multipoles, one for 
each /. The answer is yes; the proof was given by Sylvester and can be 
found in that source of all wisdom, the 11th edition of the Encyclopedia 
Britannica; look for it under harmonic functions. We do not give the proof 
here. It is not trivial. Try it for /= 2. (See Problem 1.18.) 

The number 2/ + 1 for the number of independent Q; m's is slightly 
deceptive, since the Qm s depend on the coordinate system in addition 
to the intrinsic structure of the charge distribution. Since a coordinate 
system is specified by three parameters—for example, the three Euler 
angles with respect to a standard coordinate system—the number of intrin- 
sic components is, in general, 2/ + 1 — 3 = 2/ — 2. This fails to hold for 
l= 1 or 0. Since rotations about a vector leave the vector invariant, the 
number for /= 1 is 2/+ | - 2=2/—1=1, as it must be: the magnitude 
of the vector. For /= 0, the number is 1, since the charge is invariant to 
all rotations. The full effect of the freedom of rotations shows up for the 
first time for /= 2. Here, it is convenient to define a coordinate system 
that diagonalizes the Cartesian tensor Oe In this coordinate system, the 
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tensor QO vanishes for i + j; it has, in general, three nonvanishing com- 


ponents O® , OS, and OY, with zero trace, that is, 


QO + 02 + Q® =0. (1.2.13) 


Any two of the three Qs (no sum over i) characterize the intrinsic 
quadrupole structure of the charge distribution. 


1.3. ENERGY AND STRESS IN THE 
ELECTROSTATIC FIELD 


The work done in bringing a small charge ôq; from far away to a point r; 
is 


r; 


BW, = - [ ar- Eôq = ati-ati: (1.3.1) 


w 


where we conventionally take @(~) to be zero for a system whose charges 
are all contained in a finite volume. 

If we bring up several charges 6q;, each to a position r;, we have, to 
lowest order in 6,, 


ôW = È 58W; = È O(n.) 5g; 
and for a continuous distribution (with E for electric) 


Wg: = Í dr ġ(r)êp(r). (1.3.2) 


alt space 
This is the work done, to first order in 5p, in changing p(r) to p(r) + dp(r) 
and E to E + ôE, where V: 6E = 47rép. Thus, 


ôW; = afar d(r)V - dE 
4a 


and, integrating by parts (i.e., dropping a surface integral at œ), we have 
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=a | are? (1.3.3) 


all to first order in 6p and 5E. 
Equation (1.3.3) can be integrated: The total work done is 


1 1 
i fare- t | areg (1.3.4) 
8m -© BT 


where E; is the field after the work has been done, Eo before. 

If the initial charge configuration is a uniformly distributed finite 
charge over a very large volume far Ei/87 goes to zero. 
_ If, however, we are bringing together small clumps of charge, then 
| dr Ej/87 will be different from zero for each clump and must be sub- 
tracted in the above formula. 

Assuming the first case, we can write 


Wem dre =} | de (r) p(r) (1.3.5) 
or 
Ws = f drdr’ p(r) k 4 r] p(t’) (1.3.6) 


for the work done in assembling the charge density p. Going to the limit 
of point charges (i.e., charges with radii small compared to the distance 
between them) we find that 


i 
We= = 2 qiq;l\r; Sa r;]| (1.3.7) 


2 itj 


is the work done in bringing all the charges q; from r = æ to r;. [The 
missing terms with i = j are left out because they would have been included 
in the initial energy of the separated charges. Of course, the point charge 
approximation could not be made for such terms, since the integral (1.3.6) 
would be infinite. ] 

The electrostatic energy W in (1.3.7) has the property that, together 
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with the kinetic energy of the charges q;, 


v2 
T= Èm. (1.3.8) 
it is conserved. That is, 
ee We) =90 (1.3.9) 
t 


provided the forces on the charges are purely electrostatic and given by 
Coulomb's law. 

We shall see later that a similar calculation can be made for a static 
(really, a slowly changing) magnetic field: 


Wyss Í dr B(r). (1.3.10) 
8T 


all space 


Although (1.3.9) and (1.3.10) will have been derived for slowly 
changing fields, it turns out remarkably, as we shall see later, that the 
conservation law 


E Wet Way) = 0 (1.3.11) 


still holds for rapidly changing fields. This appears to be a lucky accident, 
since it holds for electrodynamics, but does not hold for other field theo- 
ries, in which an explicit interaction term appears in the conservation law. 
An example is discussed in Section 7.4. 

We turn next to stress in the electrostatic field. We calculate the total 
electrical force on the charge inside a surface S. Introducing the sum- 
mation convention we have 


F; = | drow) E:r) (1.3.12) 


apl WV- 
me | dr E,(ryV - E(r) 


aE. 
=} [are (1.3.13) 
4a AX; 
JE; | 
a races (E,E;) - E . (1.3.14) 
4n OX; OX; 
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But, 
ð 
(V x E); = Eijk TT Ex (1.3.15) 
Ox; 
1 (2 a ) 
= —€,;,{— E; - —E;}. 1.3.16 
2 us OX; i OX, d ( ) 


Therefore, since V x E = 0, 0£;/dx; = ðE;/ðx; and (1.3.14) becomes 


2 
F= | ar(2 Œe) - 2) 
4a Ox; ax; 2 
7 ~2 
ae de (EB, ~ 8r). (1.3.17) 
4n ax; 2 
Gauss’s theorem leads to 
F,= fas, T; (1.3.18) 


where T;, the Maxwell stress tensor, is given by 


1 2 
Taden 
pean zoe (1.3.19) 


7 4a 


Equation (1.3.18) tells us that the force on charges inside an arbitrary 
surface S may be thought of as coming from a stress through that surface, 
where —T; is the ith component of the force transmitted in the j direction 
per unit area into the surface. The minus sign exists because dS; in (1.3.18) 
is the outward normal. 

A simple example: Two charges are shown in Figure (1.1). If both 
are positive, as in Figure (1.1a), the normal component of the field E, at 
the surface equidistant from the two charges is zero, so that the first term 
in T; gives zero force through that surface; the second term is negative 
and, hence, corresponds to a force into the surface, and hence a repulsion. 
This is as if the lines of force repel each other. 

For one positive and one negative charge, as in Figure (1.15), the 
situation is different: The parallel component of the field at the surface 
E\ = 0, E,, # 0. Hence, the first term in T; is twice the second term, and 
the sign of the force changes, corresponding to an attraction. The lines of 
force are under tension along their length. 

Note that there is no contradiction between a right pointing force on 
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E, #0 E=0 
E, #0 
Paa 
<< IE Sese ae 
(a) (b) 


Figure 1.1. 


the object on the right and simultaneously a left pointing force on an 
object on the left. This is, in fact, demanded by Newton’s third law. 


1.4. ELECTROSTATICS IN THE 
PRESENCE OF CONDUCTORS: 
SOLVING FOR ELECTROSTATIC CONFIGURATIONS 


The electrostatic field in a conductor must be zero. Otherwise, current 
would flow, and we would not be doing electrostatics. Therefore, the 
potential difference between two points in or on the conductor must be 
zero, since 

2 


ba=~ | Eval. 


1 


Therefore, the surface of the conductor is an equipotential, and the field 
at the conducting surface is normal to it. It then follows from Gauss’ law 
that the outgoing normal field at the surface, E,,, will be given by 


E,,=41a (1.4.1) 


where ø is the surface charge density. Note that there can be no volume 
charge density in the conductor, since V - E = 0 there. Of course, ø cannot 
be chosen arbitrarily for a conducting surface. Only the total charge Q (if 
it is insulated) or the potential œ (if it is connected to a battery) can be so 
chosen. The surface charges will adjust themselves to make the conducting 
surfaces equipotentials. The basic calculational problem of electrostatics 


1.4. Electrostatics in the Presence of Conductors 17 


is to find out how the charges have adjusted themselves and to calculate 
the potential (and fields) they generate after doing so. 

We show first that given a charge density p and a set of conducting 
surfaces S;. with either Q; or ¢@; known, the electric field is uniquely 
determined. 

Let p, yh be two presumed different solutions for the potential. Then 


l= | aro, = p) = f arvon, — pa) Vi — po) 


Vv Vv 


= 3 fas = Wr) ‘ Vid, - ws) _ faro, = TAAUA = th) (1.4.2) 


y 


where V is the space contained between the conductors. Both terms are 
zero. Since Ww, — 2 is constant over the conducting surface, the first term 
is proportional to 2,AQ,4¢,; = 0. The second term is zero because both 
W, and y satisfy the Poisson equation V* = —47p with the same charge 
density p. Therefore, / is zero so that (V(w; — W2))? is zero, and y, and 
Ww. differ at most by a constant. Thus, the electric field is uniquely 
determined by the boundary conditions and Poisson’s equation. Note that 
if any set of conductors is joined by batteries, with given potential differ- 
ences between them and given total charge shared among them, the 
expression 2,AQ;A¢, is still zero. 

The gencral electrostatic problem can therefore be formulated as 
follows: Given a set of conducting surfaces, the (appropriately specified) 
potentials and charges on the surfaces, and a given fixed charge distribu- 
tion p(r) in the space outside of the conducting surfaces, find the potential 
everywhere. 

There is no general method for solving this problem. For certain 
geometries, however, there are available specific methods, with which we 
assume the reader is familiar. These include the method of images, the 
use of special coordinate systems appropriate to the geometry, and the use 
of analytic functions of a complex variable for two-dimensional problems. 
Examples of all these are given in the problems at the end of the chapter. 

We wish to take up briefly two very general methods that are of use 
in many areas of physics. These are, first, the method of Green’s functions 
and, second, the use of variational principles. 

Green’s functions make it possible to reduce to quadratures a class of 
problems with given potentials or charges on conducting surfaces, and 
arbitrary spatial charge distribution. The formulation is as follows: given 
potentials @. on conducting surfaces S, and total charges Q, on conducting 


18 Electrostatics 


surfaces S,.4 The Green’s function G(r,r,) is the potential produced by 
a unit point charge at rı, with zero potential on the §.’s and zero charge 
on the S,’s. The potential is referred to zero at infinity. Thus, VG (rr) = 
—475(r —r,), G(r, rı) = 0 with r on each S. and is constant on each S, 
with fas, - VG(r,r,) =0. Let y be the actual potential for given p, $e, 
and Q,. Consider 


Kr)= 2 f toc.) VE) = VGE ro) HOD dS; (1.4.3) 


with dS; the inward normal to each conducting surface. On Se, the first 
integral vanishes, and the second is 


4mp. 5 Q.(rı) 


where Q,(r,) is the charge on S. for the Green’s function boundary 
condition. On S,, the second integral vanishes, and the first is 
47Q,- $,(r1), where ¢,(r,) is the potential on S, for the Green’s function 
boundary condition. 

Now use Gauss’ theorem: 


I(r) = — 4r f dr p(r) G(r, r) +47 | drôr — rı) y(r). (1.4.4) 
Combining (1.4.4) and (1.4.3), we have 
pn) = | droat r tEh (145) 


so that is given by integrals over presumed known functions. 
We show now that the Green’s function is symmetric: G(r, r1) = 
G(r, r). Consider 


I(r, r2) = 2 f dS; : [G(r;, r1) VYG(r;, r2) — G(r;, r2) VG(r;, r1)]. 
(1.4.6) 


“Note that given ¢; corresponds to Dirichlet boundary conditions, but given Q; does 
not correspond to Neumann boundary conditions, since only the total charge on a surface 
is given. Nevertheless, given Q is the physically interesting case and by the uniqueness 
theorem determines the solution and the Green’s function. 
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Clearly, 7 = 0, since on each surface G is constant, and either zero, or 
such that f VG(r,r')- dS = 0. So, using Gauss’ theorem, we obtain 


0= f erfoe, rı) G(r, r2) — VG(r, rı) G(r, r2) | 
= 47[G(ro, r,) — G(r,,r2)]. (1.4.7) 


Although there is no general exact method for finding the field in the 
presence of a given configuration of conductors and charges, there is an 
exact variational principle that applies to a general electrostatic problem 
and can be used to generate approximate solutions. 

Suppose we have a given set of conductors, with label c, on which the 
potential @, is given, another set of conductors, with label b, on which 
the charge Q, is given, and a given spatial charge density p(r). Then, as 
we have shown, the field E(r) is determined, as is the potential @(r) to 
within a constant. The variational principle we consider here is for the 
quantity” 


1= 5] dry? -47 | droy -4r È O,u(b) (1.4.8) 


and states that / is an absolute minimum when the variational function 

w(r) equals the correct potential everywhere. In the variation of y about 

the minimum, it must take on the assigned values ¢, on the c conductors. 

In (1.4.8), the function ¥(b) signifies the constant value of the function 

y(r) on the surface of conductor b (where the potential œ, is not given). 
To prove the principle, we let 


w= o+ dv (1.4.9) 


where œ is the exact solution of the electrostatic problem. Then 
1) = NO) + | de Vo Vou ~ 4x | arpay 


-4r > Q,ôy(b) + Í dr(V8w)" (1.4.10) 
b 


ŝ/ is called a functional of y. A functional is a number whose value depends on a 
function. We shall encounter this concept often. 
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and, after a partial integration, 
Kw) ~ Ub) = | Vb6u(6)- dS, — [ arouvre 
b 
-4r | drpé — 4r > QO, dtp (b) + | dr(VSy)?. (1.4.11) 
b 


The surface element dS, points into the surface of conductor b. Conduc- 
tors c do not contribute, since êy. = 0. Thus, 


Ib) — Mb) = ; | dr(Vaw)?, (1.4.12) 
since Vo = —47p and 


[vo - dS, =47rQp. 


Therefore, /(@) is an absolute minimum for ôy = 0. 


1.5. SYSTEMS OF CONDUCTORS 


Suppose we have a set of conductors, each carrying a charge Q;. The 
potential on each conductor will be a linear function of the Q,'s: 


i = È piQ). (1.5.1) 


This follows from the linearity of the equations for the fields. Thus, a 
charge Q, on conductor 1, with boundary condition Q = 0 on the other 
conductors, leads to a potential (rr) that takes on the value 
$! = p,,Q, on the ith conductor. Similarly, a charge Qz on 2 with Q = 
0 on the other conductors leads to a potential @‘’(r) that takes on the 
value @) = pQ- on the ith conductor. If both (1) and (2) carry charges 
Q, and Q», the potential is clearly the sum of these two, since the Laplace 
equation and boundary condition are satisfied. The generalization is 
(1.5.1). The p;'s are called coefficients of potential. 
The energy of the configuration can be calculated in two ways: 


1. We bring charge 6Q; up to the ith conductor. The differential work 
done is 
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BW = È 4,50, = È py Q/5Q,. (1.5.2) 
2. We know, in general, that 
W= 53 b,0:= 527i Q0, (1.5.3) 
giving 
5W = 3 (0,80, + 2,8Q))py= 5 2080p, + py) (5-4 


so that (since Q;, 5Q, are arbitrary) 


Pij t Pji = oS, 
-3 T5 Pij or Pi = Pji 


and the matrix p is symmetric. 


We have not considered a charge density p(r) here. Clearly, one 
would take such a charge density into account by first solving the problem 
of all neutral conductors with the given charge density p. Then if ġ°(r) 
is that solution, the expression (1.5.1) becomes [with ġ? = ° (r on i)] 


di- $? =È piQ; (1.5.5) 


with the p; the same as before. ¢; — #? is the potential produced on the 
conductor by the charges Q; alone. 

Returning to (1.5.1), we may solve for the Q’s as functions of the 
o’s. This is possible, since we know that 


1 
W= 5+ QipyQ; > 0 
ij 


unless all Q’s are zero. Therefore, p; has an inverse, cj, such that 
È; Pi Cjk = Six and 


Q= È cyo; (1.5.6) 
J 
The cy are called coefficients of capacitance. Of course, cy is also sym- 


metric. 
To calculate the generalized force F; on the ith conductor, we displace 
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it at constant Q: 


F,=- E (1.5.7) 


The generalized force F; is defined by the requirement that —F; ôg; be the 
work done in the displacement 6é;. Thus, F; can be a force or a torque, 
depending on whether 6; is a translation or a rotation. From (1.5.7) and 
(1.5.3), we find 


F= -52 QQ. (1.5.8) 


Note: Differentiating at constant @ would give the wrong answer. In fact, 


z 1 Ci 
P =--b p —=-F 1.5.9 
1 TI I ( ) 


This follows since, in matrix notation [ġ, Q are vectors, p, c are 
matrices, and (f, g) is a (real) vector inner product], 


(since p is symmetric) 


=5(0.-2o). (1.5.10) 


The difference in sign comes about because in a displacement keeping the 
#'s constant charge will flow, and the batteries holding the ’s constant 
will be doing work. It follows that the derivative that gives F, in (1.5.9) 
includes, in F,;6&,, both mechanical and electrical work. A correct account 
can be kept. The charge transported to the ith conductor is, from (1.5.6), 
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8Q, = È bc, ¢, (1.5.11) 
j 
and the work done by the batteries is 
Wp = È 60:6; = È b)5c; 43. (1.5.12) 
i ij 
The total work done at constant ¢ is 
1 
Wr = 52 ;8C; 6; = Wp + SW. 
where ôW y is the mechanical work: 
1 
8Wy = Wr- ôWp = — 52 b,8c,; 
D 
= 2 Qip Q; 


in agreement with (1.5.8). 

The capacitance of a capacitor can be calculated from either set of 
coefficients. A capacitor consists of two conductors carrying equal and 
opposite charge. So, with Q; = ~Q- = Q > 0, we have 


ob: = puQi + PpQ = (Pu — Pi2)@ 
2 = PQ: + p2Q = (Pu — P22) 
and 


$1 — b2 = (pir + paz — 2pi2)Q 


and 


Q 1 


C (the capacitance) = = : 
Ap pir t+ par — 2pi2 


Note that C > 0, since pj); + po2 > |2pi2|; otherwise, the energy, 


1 
W= 5 (ip + Q3p2 + 2Q;Q2pi2), 


could become negative. 
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1.6. ELECTROSTATIC FIELDS IN MATTER 


We wish here to study macroscopic electrostatics in the presence of matter. 
We will make the assumption, following Lorentz, that the macroscopic 
equations we have been using 


V-E=479 (1.1.36) 
and 


VX E=0 (1.1.17) 


hold microscopically, that is, at the atomic level. Thus, our basic equations 
are 


V-e=47p,, (1.6.1) 
and 


Vxe=0 (1.6.2) 


where we use lowercase letters to denote microscopic fields. The symbol 
Pn stands for microscopic charge density. Evidently, e and p, will fluctuate 
over atomic scale distances. We eliminate these fluctuations by considering 
average fields and charge densities, where we average over a region con- 
taining many atoms. We then try to obtain equations for the averaged 
fields. 

A subtle issue arises here: Can a description of the interaction of 
fields and matter that does not make use of quantum mechanics be correct? 
The answer is yes and no. No, obviously, because ordinary matter and 
its atomic constituents cannot be accounted for by the laws of classical 
mechanics. Yes, because in many cases, once the basic structure of the 
system has been determined by quantum mechanics, interactions with 
electric fields can be characterized by a few parameters, in addition to 
macroscopic currents and charges. Examples are the dipole moment per 
unit volume P and the dielectric constant e, which we discuss in the 
following, the magnetic dipole moment per unit volume M and magnetic 
permeability u, which we discuss in Section 2.4; and in addition all of the 
above as functions of frequency, which we discuss in Chapter 3 on time- 
dependent fields and currents. 

A subtler issue has to do with the validity of classical equations for 
the electric field. Discussion of this question of course requires the use of 
quantum field theory. The emission of a single photon by a single atom 
can not in general be described clasically. However. the multiple photon 
emission by many atoms, cach emitting one photon at a time, and their 
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subsequent absorption, can in many cases be described classically, even 
though the radiation itself is not in a classical state. This is largely a 
consequence of the linearity of the field equations. The source of the 
radiation, the charge and current densities of the radiating system, must 
be correctly described, classically or quantum mechanically as appropriate. 
The quantum behavior of matter may be taken into account, either by 
cautious phenomenology (the nineteenth century method) or by correct 
theory (current condensed matter physics). We will stick mostly to the 
nineteen century way, with the exception of the case of a dilute gas, where 
simple quantum mechanical calculations of the dielectric properties can 
be carried out. 

We proceed by averaging (1.6.1) and (1.6.2) over a region that 
contains a large number of atoms, but that is small compared to the scale 
of spatial variation of the fields. We average with a smooth function f(x) 
such that 


| dx foo = 1 


and such that the characteristic size A of f has the two properties 

AXA (1.6.3) 
where A is the scale of distance variation we hope to describe, and 

nd > | (1.6.4) 


where is the number of atoms per unit volume. A might be defined, 
for example, by 


A = j dx f(x)x?. 


We, of course, choose f(x) to be isotropic, that is, a function of x’. 
A simple model might be f = 3/4r R? for r < R and f = 0 for r > R, with 
some smoothing at the boundary. This model evidently gives A? = (3/5)R?. 
The averages are calculated as 


E,(x) = faxt — x'Je;(x'), (1.6.5) 


etc. This way of averaging has the advantage that it commutes with differ- 
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entiation. That is, 


Ty = {dx ES eu 


Ox; x; 
pape fax af(x -X R 
ax; 
= | expe is x’) elx ) 
i 
= (1.6.6) 
Ox; 
Thus we find, from (1.6.1) and (1.6.2), for E; = &;: 
VxE=0 (1.6.7) 
and 
V-E=47p (1.6.8) 


where p is the average charge density. 
In order to determine p , we divide the charge density into two classes: 


Pr = Pree and Pr = Pround- 


One may think of py as the charge density of charged atomic scale bodies, 
such as electrons or ions on the surface of a conductor. However, the 
division is not unique. For example, the induced “bound” surface charge 
on a dielectric sphere placed in an external field is, for a large dielectric 
constant, almost identical to the “true” (or “‘free’’) surface charge induced 
on a conducting sphere. (See Problem 1.27). 

Our problem is to find a useful way of expressing the space averages 
of p; and p,. We call 


Py= Ps (1.6.9) 


the macroscopic charge density, which we presume to be independent of 
the applied field (except for its distribution on the surface of a conductor). 
There remains p,. Evidently, different material systems will behave quite 
differently, and a separate analysis is really required for each one. In 
order to fix our ideas, it is convenient to consider the s.mplest possible 
system: a set of spherically symmetric atoms whose separations from each 
other are large compared to their common radius, that is, a dilute gas. 
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We consider an applied field Eo that is small compared to the internal 
fields of the atoms, that is, 


e 26 Volts 
Eo k = ~ STS 
AR ag 


~ 5 x 10° Volts/em (1.6.10) 


where e is the electron’s charge and ag the Bohr radius: 


h° 1 n 
ar = ——; ~ 7x10 "cm. (1.6.11) 
me 2 


Here, A is Planck’s constant divided by 27 and m is the electron mass. 

In view of (1.6.10), the effect of the applied field on the matter will 
be small, and we can confine ourselves to the linear approximation in an 
expansion in powers of the field. There are, of course, systems where the 
required inequality fails to apply, for example, in molecules with large 
permanent dipole moments as discussed in Section 2.4, or in highly excited 
atoms. Since the atoms in our model are far apart, the interaction between 
them will be largely governed by the multipole moments produced by the 
applied field. 

Since the atoms are neutral, the largest effect will come from the 
induced electric dipole moment. This moment will be proportional to the 
local electric field E, at the position of the atom. For our model of widely 
separated atoms, we will have approximately E, = E, the average electric 
field, so that 


Di = ah; = aE;. (1.6.12) 
The polarizability œ has the dimensions of a volume; the atomic unit of 


volume is a}, so that we expect a to be of order ay. The field produced 
by the polarized atoms will then be of order 


8E ~ aby ar (1.6.13) 
so that 
ô 
An sy (1.6.14) 
0 


where N is the number of atoms in the sample creating the field 6E, and 
R is a mean separation of the atoms. Thus, very crudely, 


ô eo) 
E __totalatomic volume | (1.6.15) 


Eo total occupied volume’ 
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for a gas, this ratio is ~d/d,, where d is the gas density and d, the liquid 
or solid density of the same atom. For air at normal temperature and 
pressure, the ratio (1.6.15) is about 1077, so the vacuum field is not 
appreciably perturbed by the presence of the gas, and the effect of atoms 
on each other will be small. The local field E, that polarizes the individual 
atom will be approximately equal to the average field E. We wiil return 
to the question of atomic polarizability in Chapter 3. (See also Problem 
1.36.) 

A quadrupole moment can also be present; in an isotropic atom, 
however, the tensor quadrupole Q,; can only be induced by a tensor field: 


aE, dE; l 
Q, = ag (=+). (1.6.16) 
OX; OX; 


The quadrupole polarizability ag in (1.6.16) has the dimensionality L 
so we expect ag for an atom to be ~a}. The field generated by the 
induced moment, in analogy with (1.6.13), will be 


Eo ~ T 


1.6.17 
E =r," ( 
or 
Ey Nay 
Lo ae ae (1.6.18) 
so that 
Eo totalatomicvolume _ a$ (1.6.19) 


Ey total occupied volume R? 


This is smaller than the dipole effect by about 10° '* and is thus completely 
negligible. Higher moments clearly make even smaller contributions. 

If we modify our model by bringing the atoms closer together, for 
example as a dense gas, or a solid or liquid, it will no longer be possible 
to ignore the interatomic interactions. It will still be true that the effective 
field SE generated by the atomic dipoles will be of the order of magnitude 
given by (1.6.15); however, the ratio in (1.6.15) can now be of the order 
of unity, so that the effect of the atomic polarization will be not only large 
but also not simply calculable. (For a quite successful way of estimating 
E, for a denser system, see Problem 1.36.) We still expect the atomic 
polarization to be a linear function of the average field in the neighborhood 
of the atom, and we still expect the higher multipole moments to make 
negligible contributions. However, in addition to the density dependence, 
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a significant change will be that the relation between p and E may, in 
general, be tensorial: 


where the tensor œ;; would depend on the symmetry of the material. A 
locally isotropic material, or a crystal with cubic symmetry, would revert 
to the scalar relation with a, = @6,,. 

Before proceeding, we observe that material not locally isotropic can 
also possess clectric moments even in the absence of an applied field. A 
crystal without reflection symmetry, for example, can have a permanent 
electric moment. Such a material is called ferroelectric or pyroelectric. We 
can obtain an order-of-magnitude estimate of the field produced outside a 
material whose atoms are permanently polarized with a dipole moment 
Po. It will be 


NPo 
R? 


Epo o 


where N is the number of atoms in the sample and R a mean distance to 
the field point. Following the reasoning used to arrive at (1.6.15), we find 


E, 


~ 
a 


Po 
3 
B 


With po~ eagé, where ¿ is a number of very rough order-of-magnitude 
unity, we have 


e Volts 
Epi E 20E ; 
: é ag A 


which is a very large macroscopic field. This field is reduced by two effects. 
First, the parameter & turns out to be quite small since the energetics of 
the quantum states mitigates against a large dipole moment. Second, since 
the conductivity of the material is never exactly zero, the dipole moment 
of the sample tends to be canceled by a migration of electrons to the 
surface. Similar reasoning shows that permanent quadrupole moments can 
generate macroscopic fields of rough order, Volts/em. Although these 
permanent fields are of considerable interest, they do not require further 
discussion here, since they play the role of fixed applied fields in our 
discussion of electrostatics. 

The final result: Only the dipole field is important in most macroscopic 
electrostatics. For it, the average potential wili be given by the average 
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dipole moment per unit volume [which we call P(r)] by the dipole formula 


1 
[=h 


ġ=- far P(r’): V (1.6.21) 


To see this unambiguously, we calculate the averaged value of the poten- 
tial @, arising from the atomic dipole moments: 


rF-Yr; 


$p R) = [arp -r)È pi: (1.6.22) 


r-ri 


where we have legitimately used the formula for the field of a point dipole, 
since in the averaging process, most of the range of r will be far from the 
dipole r;. 

Now let r~r;=R-—R’ with R’ the new integration variable. Then 
(1.6.22) becomes 


R-R' 
(R) = ar(S R’-r, ) ee, 1.6.23 
$R) = | aR (EAR - r) eo WA 
We see that 2; f(R' — r;)p; essentially counts all dipoles inside the averag- 


ing distance of f, so that 2, f(R’ — r;)p; = P, the dipole moment per unit 
volume. Returning to (1.6.21), we see that it can be rewritten 


b= [ar[e -( Br) )- ue a (1.6.24) 


i | 


We now have a choice. We clearly will use Gauss’ theorem, but we 
may either treat the dielectric boundary as a continuous (but rapid) change 
from finite P to 0, in which case we would have no surface term, and a 
potential 


$ -Í ATRO (1.6.25) 


|r — 
corresponding to an average bound charge density 
pPp= —-V-P, (1.6.26) 


or, we could treat the integral as confined to the dielectric with a sharp 
surface S$, in which case we would have 
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peas, p POVP) 
ob + (1.6.27) 
|r—r'| [r-ri 


VCS 


corresponding to a pà = —V-P and a surface charge F, = Prormai On the 
surface. Obviously, if there is a sharp boundary, (1.6.25) must produce 
(1.6.27) in the limit. This clearly comes about because V - P approximates 
a delta function on the surface. 

We can now return to our original averaged equations. We call p; = 
pP. Ph = —V~-P, and find, from (1.6.26), (1.6.8), and (1.6.7), 


VxE=0 (1.6.7) 
and 
V-E=47p —4nV-P. (1.6.28) 


We are moved to define an electric displacement 

D=E+47P (1.6.29) 
which satisfies the equation 

V-D=47p. (1.6.30) 


Equations (1.6.7) and (1.6.30) determine how we should treat a sharp 
boundary. From (1.6.7) we find, using Stokes’ theorem on the rectangle 
shown in Figure 1.2a, and Ax/Al-> 0, that E;angentias MUSt be continuous 
across the surface. For the D boundary condition, we use Gauss’ theorem 
and the pillbox as shown in Figure 1.2b, with Ax/VAS — 0, to find AD,, = 
47a, where ø is the free surface charge density. 

Equations (1.6.7) and (1.6.30) require a relation between E and D to 


AX = fw 
if boundary 
| Ax a, Ss 
SELFY e eee ae | 
i: T EDN ~ 
boundar 
(a) |= 4 (b) 


Figure 1.2. 
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determine the configuration. Since the system is linear, the earlier argu- 
ments of this section show that P must be proportional to E, that is, 


P,= xjEj (remember the summation convention) (1.6.31) 


where for a dilute gas y,,=na,, with n the number of dipoles per unit 
volume. Therefore, 


where a symmetric a, will produce a symmetric e; €; is called the 
dielectric tensor. An isotropic material as will be the case here would 
require e; = € ô; and D = eE. e is called the dielectric constant. 


1.7. ENERGY IN A DIELECTRIC MEDIUM 


From the basic equations (1.6.7) and (1.6.30), we learn that there exists 
a potential @ such that E = —V@. Therefore, the work done in bringing 
an infinitesimal charge êq to a point r is 


ôW = dq ¢(r), (1.7.1) 


where we have set @(*) = 0, as usual. For a distributed charge, 
ôW = f draenei 


1 A 
= a [ara(y -D)¢(r) 


=- = | arD- v6. (1.7.2) 
4a 
or 
sw- 1 | drop e. (1.7.3) 
4a 


In the integration by parts, we have ignored surface terms on 
conducting surfaces. These are implicitly included in (1.7.2); if made 
explicit, the surface term in the Gauss’ theorem integration by parts 
would have canceled the 5Q¢@ surface contribution to 6W, so that (1.7.3) 
continues to hold in the presence of conductors. 

We consider here only linear media, for which 
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D,;= ej Ej. 
Then 
1 
ôW = ae | arenlOE DE, (1.7.4) 
4r 
1 ; 
= we | ares + ej )8E;- E; (1.7.5) 


where €} = (e; + €;)/2 is the symmetric part of €,; €f = (ej — €,)/2 is 
the antisymmetric part. 
The first term in (1.7.5) can be integrated since 


e$ 6(E,E;) 


(1.7.6) 


However, the integral of W° = e// 8E; E; depends on the path of integra- 
tion. In particular, a closed path in E space will not, in general, integrate 
to zero. Take a path in the £.,E,-plane. Then by Stokes’ theorem 


w^ = eS h(E. dE, — E, dE,)/87 


ae (area enclosed by the path) l (1.7.7) 
4a 
Thus, a static, antisymmetric component in the dielectric tensor indi- 
cates a medium that absorbs or produces energy. An energy-conserving 
system must therefore have a symmetric dielectric tensor, €, = €j, to 
which case we restrict ourselves here. The work done in charging the 
system is then 


w= 1 | are-p (1.7.8) 


ST 


and may be identified with the electrostatic contribution to the free energy 
of the system. 

Equation (1.7.8) has a remarkable property. It appears to express the 
electrostatic energy 


Win az -far e? (1.7.9) 
87 


“We will discuss absorption for time-dependent fields in Chapter 3. 
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(where the subscript m indicates that W,, is calculated from the micro- 
scopic field strength) as a quadratic functional of the averaged field 
strength and polarization, E and P = (Ð — E)/47; that is, it appears to 
ignore fluctuations of the microscopic field. 

Actually, in the absence of conductors, the quadratic dependence is 
only apparent. We can see this by writing the field E as 


E=E,+E, (1.7.10) 


where Eo is the field produced by the macroscopic charge density p and 
E, the field produced by the polarization charge density p, = —V-P. The 
electrostatic energy change produced by introducing the dielectric medium 
is 


aw= 2. | an(e-D~ Eò (1.7.11) 
81 


PEN | dr|(E + Eo): (D - Eo) - 47P Eo}, (1.7.12) 
BT 


where P is the polarization density. In the absence of conductors, the first 
term in (1.7.12) vanishes after an integration by parts, since V - (D — Eo) = 
0 and (E + Eo) = —V(¢ + ġo). There remains 


aw= -1 dre ky (1.7.13) 


which is a linear functional of the microscopic polarization and bilinear 
only in the applied field. 

Equation (1.7.13) also holds in the presence of conductors, although 
the proof is more complicated. 

It should be noted that following (1.7.8), we referred to W as the 
electrostatic contribution to the free energy (as opposed to internal en- 
ergy) of the system. This is because in calculating the work done in 
electrifying our system, we kept the dielectric constant, and hence the 
matter density and temperature, fixed. 

The change in free energy F is defined by 


SF = 8(U — TS) (1.7.14) 


where U is the internal energy, T the absolute temperature, and S the 
entropy. Therefore, 
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5F = 8U — TSS - S8T 
= 6W- SST (1.7.15) 


where 6W is the work done on the system. 
The work êW includes mechanical work, such as the familiar mechan- 
ical work done on a gas: 


Wn = —P6V (1.7.16) 
and the electrical work done on a dielectric medium: 


5W, = =. faep- Ey - 5Ep). (1.7.17) 
T 


As before, Eo is the applied field, that is, the field that would be present 
in the absence of the dielectric sample. We see thus that at constant T 
and V, ôF = 88W.. 

The arguments leading to (1.7.13) also lead to 


SW, = -f are. SE, (1.7.18) 


showing that the independent variables in the free energy should be vol- 
ume (or density), temperature, and Ep. 
We then learn, from (1.7.15), that 


eee (1.7.19) 
OT \p.Eo 
and, from (1.7.18), that 
Pa) = = = ; (1.7.20) 
6E,(r) 


where (1.7.20) and (1.7.18), in fact, define 6F/[5Epo(r)] as the functional 
derivative of F with respect to E,(r). Here p is the matter density of the 
material. 

The internal energy is given by 


=F- a : (1.7.21) 
oT p.Eo 
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The electrical contribution to the internal energy is therefore, from 
(1.7.13) for AW and AW = AU in (1.7.21), 


oP(r) 


1 
Ua = — > | dr E,(r) : [Pe = en : (1.7.22) 


Forces and stresses may be obtained by calculating the change in F 
resulting from appropriate displacements. For example, the total force or 
torque F; on a dielectric would be calculated as 


aoe 
ag; 


where é; is the appropriate conjugate variable: for force, a fixed coordinate 
in the dielectric, for example, the center of mass, for torque an infinitesi- 
mal angle of rotation about a coordinate axis. If there are conductors 
present, the arguments of Section 1.5 show that the total charge on each 
conductor must be held fixed in (1.7.23). 

The calculation of internal stresses in a dielectric is harder, but can 
be carried out by considering internal displacements. A straightforward 
treatment of stresses in a fluid dielectric is given in Panofsky and Phillips.’ 


F,= (1.7.23) 


CHAPTER 1 PROBLEMS 


Application of Gauss’ and Coulomb’s law to simple systems 


1.1. Consider a spherically symmetric charge distribution, p = p(r). As- 
suming the resultant electric field E to be radial, E = r£,(r), show 
that E,(r) is given by 


Er) = Ot") 


where Q(r) is the charge inside r. 


1.2. Consider a cylindrically symmetric charge distribution, that is, d = 
d(p), where z, p, and » are cylindrical coordinates, and d is now 
the charge density. Assuming the field to be radial, E = @,£,(p), 
show that E,(p) is given by 


7W. K. H. Panofsky and M. Phillips, Classical Electricity and Magnetism, Reading, 
MA: Addison-Wesley, Chap. 6. 


1.3. 


1.4. 


1.5. 


1.6. 


1.7. 
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2 
E,(p) = > Q(p) 


where Q(ọp) is the total charge per unit length inside the radius p. 


Consider a large, rectangular charged plane, with uniform surface 
charge density ø, lying at z = 0. Near the center of the plane and 
near z = 0, assume by symmetry that the field is in the z direction, 
E = ê. E.(z). Then show that 


E. = 270 €(z) 


where €(z) = 1, z>0, e(z) = —-1,z<0. 
Now let the charged plane at z = 0 extend from x = —/, tox =/, 
and from y = —/ to y = l. Investigate the assumption that at x = y = 


0, the fields £, and E, are zero, and E. is independent of x and y. 
Give the conditions on /,, h. /, and z under which the result of 
Problem 1.3 is approximately correct. 


Do the same for the situation of Problem 1.2, where the cylindrical 
distribution runs from z = —/, to z = h. Hint: The final azimuthal 
integral can be done by going to the complex plane. Let z = e’®. 


Prove that a conductor is an electrostatic shield. That is, show that 
the (static) field inside an empty hollow conductor is zero. 


From Problem 1.6 it follows that a conductor inside a charged 
conducting shield will not become charged even if put in electrical 
contact with the outer conductor. This is independent of the shape 
of either conductor, but depends, via Gauss’ law, etc., on the inverse 
square law. For a slightly different force law, the inner conductor 
will normally become charged when connected to a charged outer 
conductor. The calculation of that charge is difficult except for 
specially shaped conductors. Consider now that both conductors are 
thin spheres. concentric, with radii b and a, with b > a. The outer 
conductor 6 carries a charge Q and is electrically connected to the 
inner conductor. 

(a) Calculate the charge 6Q induced on the inner conductor if the 

force law is derived from a potential 


where L >> b. Note that this form of potential would result 
from a theory with massive photons. 
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(b) Same as (a) but with 


with |e] << 1. 
(c) Same as (a) and (b) but with 


Ng 
v=4(14 (2) } 
r r 
with |e] <1 and -1<B <1. 


Problems with images 


1.8. 


1.9. 


1.10. 


1.12. 


Two conducting plates make an angle of 90° with each other. Let 
one plate be in the yz-plane, with y > 0; the other in the xz-plane, 
with x > 0; and place a point charge Q at x =x >0, y= yo >O, 
z=0. This configuration can be solved with images. Where and 
what are they? 


Two conducting planes make an angle of 45° with each other. A 
point Q is placed at a distance R from the vertex, half-way between 
the plates. This configuration can be solved with images. Where and 
what are they? 


A point charge Q is placed at a distance L from the center of a 
grounded conducting sphere of radius R < L. Show that the con- 
figuration is solved with an image charge Q’ placed at a distance 
R' = R’/L from the center of the sphere. Of course, R'< R. 
Determine Q’ as a function of Q, R, and L. 


. The same as Problem 1.10, except with a neutral isolated conducting 


circular cylinder, radius R, and a line charge A a distance L from 
the center of the cylinder. 


A point dipole is at a distance L and pointing away from the center 
of a grounded sphere of radius R. Using the method of images, find 


(a) The charge induced on the sphere. 


(b) The electric dipole moment (with respect to the center of the 
sphere) induced on the sphere. 


. Two isolated conducting spheres of radii R, and R, are placed a 


distance L apart, with L >> R, and L > R3. A charge Q is placed 


1.15. 


1.17. 
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on sphere 1. This configuration can be solved by successive approxi- 
mations involving images, images of images, etc. 


(a) Calculate the lowest order (in R, 4, 2/L) contribution to the force 
between the spheres. Carry the calculation one step farther, 
sufficient to identify the power of Ri or 2/L of the next 
correction. 

(b) If you calculated the force by setting it equal to the force 
between the image charges, your calculation was correct. Can 
you justify this statement? Although the answer “no” may be 
correct (i.e., you cannot justify this statement), it is not accept- 
able. 


Problems involving spherical harmonics 


. An isolated conducting sphere is placed in a uniform electric field 


Ey in the z direction, so that the applied potential is 
do = -Ez = — Eor cos = — Eor P, (cos 8). 


Find the potential 5¢ generated by the induced charge distribution 
on the surface of the sphere. From it, find the charge distribution 
itself and the dipole moment of the sphere. Assume that the sources 
of Eo are far enough away so that they are unaffected by the 
introduction of the sphere. 


(a) Suppose the applied field vanishes at the center of the sphere, 
with 


E, = Ey:~, E,=-—, E,=-—= 
L 


Find the potential ġo(r) that corresponds to this field, the poten- 
tial 6@ generated by the induced charge distribution on the 
sphere, the induced charge distribution itself, and the total 
charge, dipole moment, and quadrupole moment of the sphere. 

(b) How would you arrange a charge distribution to create this 
applied field near the sphere? 


. A nonconducting, very thin shell of radius b carries a surface charge 


distribution 
1, 
o = a0(cos 6 + a 8 cos o). 


Find the electrostatic potential for r >b and r< b, assuming the 
field to be finite everywhere and no other charge to be present. 


Solve the problem of a point charge and a grounded conducting 
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sphere by expanding the Coulomb potential of the point charge in 
spherical harmonics with respect to the center of the sphere, solving 
for the potential for each / and resumming. 


Show that a single Maxwell quadrupole, characterized by two unit 
vectors ñi, fz and a magnitude Q, can produce the correct 
asymptotic potential of an arbitrarily confined / = 2 charge distribu- 
tion. Clearly, the plane of 7, and #2 must contain two of the principle 
axes of the quadrupole. However, you must choose the right two. 


Problems in two dimensions involving the use of analytic functions 


(Prerequisite: Some knowledge of complex function theory.) 


1.19. 


1.20. 


When the potential is independent of one rectangular coordinate, 
the Laplace equation for the potential @ becomes 

32 3? 

cat EL 


0. 


ax? oy 


The real and imaginary parts of any analytic function satisfy this 
equation, as we now show. 


(a) An analytic function of a complex variable z 


w= f(z), 
has a derivative 
dW im [ae + Az) - f(z) 
dz Aso Az 


which is independent of the path by which Az 0. With z = 
x+iy and w=u+tiv, show that this path independence 
implies the Cauchy—Riemann equations 
au _ av 
ox ay ay ax 
(b) Show that the Cauchy-Riemann equations imply that u and v 
are harmonic. 


(c) It is conventional to choose the imaginary part of w, v to be 
the potential.” With this choice, show that the lines of force 
are lines of constant n. 


Given one or more two-dimensional conductors (i.c., three-dimen- 
sional cylinders), one looks for a function w = f(z) such that the 


‘One could equally well choose a. the real part of w. as the potential. 


1.21. 
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cylinders are lines of constant v. The potential problem is thus 
automatically solved. Show then that the charge density on the 
conductor is 1/47 du/al, where dl is the length variable along the 
conductor. To illustrate this technique, consider the function 


ae 
atei =o 


defined to be the positive square root for x > 0 when y approaches 
zero from above and analytic in the cut plane (cut from y = 0, x = 
Oto y=0,x=*%). 


(a) Find the equation for equipotential surfaces. 

(b) The equipotential v = 0 is obtained by letting y — 0 from above. 
For 0 < y <x, give v as a function of x and y and find the charge 
density on the upper surface of the conductor as a function of 
x. 

(c) The value of w for y <0 is found by analytically continuing 
around the singularity. Use this procedure to calculate the 
charge density under the plate. 


Note that although the charge density is singular as x — 0, the 
integrated charge in a finite region of x is finite. Also, since the 
total charge is infinite (in three dimensions), there is no uniqueness 
here. In practice, the conditions that determine o and b will come 
from boundary conditions at %. (See Problems 1.22 and 1.23.) 


The potential of a line charge A is 2A log p, where p is the cylindrical 
radius. If the line charge is at position p’, the potential at p is 


(p) = 2A log|p — p'|. 
If the charge is distributed with a density o(p’), the resultant poten- 
tial will be 


(p) = 2 | dp a(p') log |p — p’|, 


where the area element dp'=p'dp'dg’, and |p-p'|= 
(p° + p° — 2pp' cos(g — y'))\*. Show that the two-dimensional 
potential has a “multipole” expansion, for p outside of the charge 
distribution, 


o(p) = 2A log =S 5 l Ha [cm cos me + dm sin me] 
m=1 Mp 
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where po is an arbitrary length scale, A the total line charge, 
à = | do’ of"). 
and 
cm = | dp (p')" cos me’ olp’), dn = | do' (p')" sin me’ o(p’). 


You should make use of the identity 


Iz = z'| =|? + p°- 2pp' coslo — ¢')|'” 


where z is the complex variable z = pef, and Re log(z - z') = 
log|z ~ z’|. 


Consider the function 


v-o) 


defined to be real and positive for |x| < b and y +0 from above, 
and analytic in the cut plane from y = 0, x = —b to y=0,x =b. 
(a) Find the value of w for x =b+e,y=Oandx=—b-—e,y=0. 


(b) Find the value of w as x ++, From this, find the physical 
system represented here. 


Consider the analytic function w(z) given implicitly by 
1 =E 
TEME E “™) 


(clearly, dimensional coordinates would scale z by a length unit and 

w by a potential unit). Let w = u + iv, with v as the potential. 

(a) What is the line v = 0 in the x, y plane? 

(b) What is the line v = 1 in the x, y plane? 

(c) What is the electrostatic problem solved by this function? 

(d) Sketch the equipotential lines with v = 1 — e and v = e, when e€ 
is very small. 

(e) Give the potential and charge densities on both sides of the 
conducting surfaces as x — ~. 

(f) Near x = y = 0, the relation between w and z is w = @ Vz, where 
@ is a constant. Relate the constant «œ to the potential difference 
and distance between the plates of this capacitor. 
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Problems involving dielectrics 


1.24. 


1.25. 


1.26. 


1.28. 


Formulate and prove the uniqueness theorem for conductors and 
dielectrics with e(x) > 0. 


Formulate and prove a variational principle for dielectrics and con- 
ductors analogous to (1.4.8). 


A point charge g in vacuum is a distance x =/ from the plane 
surface of a dielectric extending to x = œ. The dielectric constant is a 
constant, €. The fields for this configuration of charges can be found 
using the method of images: The potential outside the dielectric is 
given by the Coulomb potential of the charge q plus the Coulomb 
potential of an image charge q’ in the dielectric. The potential in 
the dielectric is given by the Coulomb potential of an image charge 
q" outside of the dielectric. Find the location and values of the two 
image charges. Calculate the force on the dielectric as the Coulomb 
force between q and q’; then verify that this is correct by integrating 
the stress tensor over an appropriate surface. Is there another reason 
for believing the result? 


A dielectric sphere with dielectric constant e is placed in a uniform 
electric field Ey (whose source is far enough away to be unaffected 
by the sphere). Find the electric field outside the sphere and the 
induced electric dipole moment of the sphere. Compare the result 
for large e to that for a conducting sphere. 


The problem of a point charge q a vector distance b from a neutral 
dielectric sphere of radius a cannot be solved with images; however, 
an integral for the correct potential at radius r can be obtained using 
the method of Problem 1.21. The answer for the potential outside 
the sphere (r > a) is 


q 
= +ê 
$ Ir -b| o 
where 
a(e-— 1) 
ôb = -q-——!l 
$ b e+i 
with 
Fa 1 1 4 ; (eye 
(+ 2_9 8 1/2 i/y y E -A š 1/2 
y y cos 0) Yy a+y 2y’ cos 8) 


0 
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1.32. 


1.33. 


1.34. 
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where y = 1 + e and y = a’/br. Show that this answer is correct and 
give a similar answer for the potential inside the sphere (r < a). 


Miscellaneous problems 


. Prove that the mean value of the electrostatic potential in vacuum 


averaged over the surface of a sphere is equal to the potential at 
the center of the sphere, provided there is no charge inside the 
sphere. 


Find the equation for the lines of force in the xy plane around an 
electric dipole lying along the x-axis. The line of force points in the 
direction of the electric field, so its differential equation is 


dy _ Ey 
dx Ec 


Hint: Use polar coordinates in the plane to obtain an equation r = 
rof (8). Sketch a few lines. 


Repeat Problem 1.30 for a quadrupole with a potential 


1 — 3cos? 8 
parn 
a 
The answer is straightforward in the first quadrant (0 = @ < 7/2). 
What happens in the second quadrant (7/2 < @ S 7)? 


(a) Calculate, from first principles and the definition of electric 
dipole moment, the force between two dipoles p, and po. Is 
Fa = —F,2? 

(b) Calculate the torque T2 exerted by p; on p2. Is Tiz = —72,? If 
so, fine. If not, explain what happened. 


Prove directly, using a Green's type theorem, that the coefficients 
of potential are symmetric: py = pj- 
(a) Find the Green’s function for a conducting insulated sphere. 


(b) Find the Green's function for a conducting sphere held at fixed 
potential. 


(c) Verify the symmetry G(r,r,) = G(r,.r) for both cases. 


The simplest electrostatic variational calculation is quite com- 
plicated. Consider, for example, the problem of finding the field 
produced by a constant charge density 


P= Po. r<a 


and 
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p=0, r>a. 
Try a variational function 


TELA r>a 


H 


z 
w= Br” +C, r<a. 


From the boundary condition at r= a, find C as a function of A, 
B, and a. Calculate the variational integral 7 and minimize with 
respect to A and B. There remains a dimensionless function of n 
and m. Show that the minimum of / occurs for n = 1 and m = 2, 
the correct values. 


There is a very successful model that takes into account the interac- 
tion of atomic dipoles with each other in calculating the dielectric 
constant. This model leads to a formula known as the Clausius- 
Mossotti relation. The argument starts by noting [as shown in 
(1.6.23)] that the macroscopic (i.e., average) field E is generated by 
external sources and by the average dipole moment per unit volume 
P of the dielectric. Thus, 


— r’ 
-r 


E(r) = E (from external sources} — V | dr'P(r’) - | : 
r 


Presumably, the part of the above dr’ integration coming from large 
values of jr — r'| gives a good approximation to the contribution of 
distant dipoles to the local field at an atom. However, the contribu- 
tion of nearby dipoles must be explicitly summed. A remarkable 
slight of hand follows: the integral over dr’ coming from a very 
small, but macroscopic, sphere surrounding the atom at r is sub- 
tracted from the above formula and replaced by the sum of the 
fields of the point dipoles in that ais Thus, 


r>- 
Ejoca(t) = E + V dr'P(r’)- =e vy pi> 3° 
|r — ern = =F; 
inside spher 
sphere 


Now comes the slight of hand: For the two extremes of a crystal 
with cubic symmetry, on the one hand, and for randomly placed 
dipoles, on the other, the sum over į vanishes. Also, for a small 
enough sphere, P(r) will be approximately constant over the sphere. 
With these assumptions, show 


4 
(a) Eocai = E F a2 


and from this 
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(b) ae T ia 
e+2 3 


where a is the atomic polarizability and n the number of atoms per 
unit volume. This formula predicts that the measurable quantity 
(e+ 2)n 
e—i 
for a given substance should be approximately independent of 
external parameters, such as pressure and temperature. Note that 
weak coupling between the atoms corresponds to small na, so that 
e-—1=47na. 


*1.37. For an interesting problem employing standard electrostatic 
methods, see Problem 2.14 (magnetic levitation of a super- 
conducting sphere). 
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CHAPTER 2 


Steady Currents and 
Magnetostatics 


2.1. STEADY CURRENTS 


We describe the flow of currents in a medium by a vector current density 
j(x), where j(x) - dS is the charge crossing the surface element dS in unit 
time, that is, the current through dS. Total charge is conserved. Thus, 
with p(x, t) the charge density, the decrease of charge in a volume V must 
equal the flow of current through the boundary surface S: 


Ss 
or 
op _ ; 
- far? = f drv, (2.1.1) 
v vV 

for any volume V; hence, 

20 gaei; (2.1.2) 

ot 


Equation (2.1.2) is called the continuity equation. The conservation of 
total charge is the global reflection of the local law. [Almost all the known 
conserved quantities in physics have local densities that satisfy a local 
equation like (2.1.2). We shall see many of them.] For a Static situation 
dp/at=0 and V-j=0. That is the case we will be considering in this 
section. 
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For a large class of media and sufficiently small electric fields, macro- 
scopic currents are generated according to Ohm’s law: 
j=oE (2.1.3) 


or more generally, 
ji= oyk;. (2.1.4) 


The tensor ø; is called the conductivity. The rate at which work is done 
on currents, producing heat, is 


W 
~- dr Ejj, = | dr Ey, (2.1.5) 
¢ 


so that the symmetric part of o;; must be positive. The antisymmetric part 
of ø; does not contribute to dW/dt. An example of an antisymmetric 
conductivity tensor can be found in the Hall effect. where a, has an 
antisymmetric component 


oj; = constant €; By (2.1.6) 


with B; the magnetic field. (See Problem 3.5.) 
The equations that govern the conducting medium are, with (2.1.4), 


VxE=0, (2.1.7) 
j= jE; 

and 
V-j=0, (2.1.8) 


identical to the equations for a diclectric medium, with the conductivity 
replacing the dielectric constant and j replacing D. Equations (2.1.3), 
(2.1.7), and (2.1.8) imply Ohm’s law for the relation between the potential 
difference A@ along a conductor and the current 7 flowing through it: 


Ad = RI (2.1.9) 


where R is called the resistance of the conductor. R is simply related to 
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the conductivity for a cylindrical conductor: 


L 
Res 2.1.10 
a ( ) 


where L is the length and A the cross-sectional area of the cylinder. 

A current in conductors can be generated by electromagnetic induc- 
tion (to be discussed later in this section) or by a chemical or thermal 
source of energy (for example a voltaic cell). We give a brief discussion 
of the latter here. 

A voltaic cell is usually an arrangement of two electrodes of different 
material immersed in an ionized fluid, or electrolyte, such that it is ener- 
getically profitable to transport positive ions to one electrode (the positive 
terminal) and negative ions to the other electrode (the negative terminal). 
The electrochemical energy per unit charge transferred is called the elec- 
tromotive force € of the cell. Charge will build up on the electrodes until 
the opposing potential difference of the electrodes Ad = €. Thus, the 
open circuit voltage of a cell is equal to €. 

When the external circuit is closed, positive charge will flow in the 
external circuit from + to —. and inside the cell from — to +. What 
happens to the external potential Af? At 7 = 0, Ad = @. It is reasonable 
to expand Ad about € in a power series in J and to keep the first two 
terms: 


hip Se APT es (2.1.11) 


We call r the internal resistance of the cell. r must be positive since when 
current / flows, the electrochemical power produced is é/. This power 
cannot be smaller than the power Ad@/ transformed into heat in the exter- 
nal resistance. Thus, 


€1>Adl (2.1.12) 
or 


r=0. (2.1.13) 


For the equality to hold, there would have to be no dissipation in the cell 
itself, and the cell would be a perfect conductor. Normally, r > 0 and 


[= d : 
Rt+r 


(2.1.14) 
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2.2. MAGNETIC FIELDS 


Units 


We work in mixed Gaussian units: e.s.u. for electric field and charge, 
e.m.u. for magnetic fields (gauss or oersted). Current is fixed by units of 
charge and time: / is in e.s.u/sec. Note that 1 Ampere = 1 Coulomb/sec = 
3 x 10° e.s.u./sec. Magnetic units start with the force between magnets, 
treated as dipoles. Thus, as in electrostatics, two unit magnetic poles 
situated 1 cm apart exert a force of 1 dyne on each other. Since magnetic 
poles do not appear in nature, one calculates (in principle) the force 
between two small, widely separated magnetic dipoles: 


F, on2 = (m2 : V2) B(r2) =, (m2 i V2) ( Vz mi s (z = w) (2.2.1) 


ri2 


or, with riz = r2 -TrT;, 


3ri2 3[m2m, + r2 + mm2: r:2} 
Fos = Dh 
ri2 ri2 
! m2'ri2M;'Tı2 
— E (2.2.2) 


ri2 


With F in dynes and 7,2 in centimeters, (2.2.2) determines magnetic mo- 
ment in e.m.u. The magnetic field B produced by a moment m is then 
given in gauss by 


aa eae ae as (2.2.3) 
r r r 


mr m _ 3r(m-r) 


The magnetostatics of permanent magnets had been clarified by the 
end of the eighteenth century. However, there was at that time no known 
connection between magnetic and electric phenomena. Then, in 1820, 
Oersted discovered the magnetic field surrounding a wire carrying a steady 
current. Within 12 years, Ampére, Biot, Savart, Faraday, and others had 
worked out the physics of magnetic fields and steady currents, culminating 
in Faraday’s discovery of electromagnetic induction. 

In working through this subject, we start from the discovery by Biot 
and Savart that the magnetic field at a point r due to a circuit carrying a 
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current 7 could be calculated from the formula 


B(r) = pay TOS ED. (2.2.4) 
c lr—r'| 


where far signifies a line integral around the circuit and r’ is the 
position vector of dl’. The constant c has the dimension of velocity, 
c =3 x 10'° cm/sec. 

Equation (2.2.4) contains a strong hint as to the force of a magnetic 
field on a circuit. Let us use (2.2.4) to calculate the force of B on a 
hypothetical magnetic pole p. It is 


Foonp = pB = -tfar ip. (2.2.5) 
c z 
We recognize 


po = Br’), (2.2.6) 


where B,(r') is the magnetic field that would be produced at r’ by the 
hypothetical pole at r. Thus, 


Fong > ray x B,(r’), (2.2.7) 


and if we assume that action is equal and opposite to reaction (a treach- 
erous assumption here, as we shall see later, but correct for circuits 
carrying steady currents), we find that the force on a circuit is suggested 
to be 


F= tfar x B(r’). (2.2.8) 
€ 


The artificiality of the use of poles could be avoided—we could arrive at 
the same result (2.2.8) by considering a real dipole instead of a hypothet- 
ical pole. More important, the formula (2.2.8) is experimentally correct. 
Thus, one can calculate the magnetic field due to a current in a wire as if 
it were due to a sum of contributions from each circuit element dl’, with 


dps ge SS Fd (2.2.9) 
c jr- rf 


and the force on a circuit as if it were the sum of forces on each circuit 
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element dl’ of 


dF = tay xB. (2.2.10) 
c 


We might try to guess the field of a single moving charge from (2.2.9) 
and the magnetic force on a single moving charge from (2.2.10). To do 
this, we note that with n charges per unit volume, each of magnitude q 
and velocity v, 


I= nqvA (2.2.11) 


where A is the cross-sectional area of the wire. Multiplying by dl/d/ 
converts v to v, and multiplying A by di produces the volume of the 
current element dl. Thus, [dl = Nqv, where N is the number of charges 
in dl. With one charge, we would guess from (2.2.9) 


(2.2.12) 
and from (2.2.10) 


F,=q~ xB. (2.2.13) 
. 


the Lorentz force law. 

Remarkably, it turns out that (2.2.13) is exactly right (if F is properly 
interpreted) and (2.2.12) is approximately right for low frequencies and 
velocities < c. 

One disconcerting discovery is that (2.2.12) and (2.2.13) do not satisfy 
Newton’s third law. Thus, 


v 
Foon = qı = x B2(r,) 


=q x |a Patino (2.2.14) 
c ce r-ri 
or 
Fz oni = A y, x (v2 x (rı — r2)) (2.2.15) 
Criz 
and 
Fiona = Bi v, x (vi x (r2 — 11). (2.2.16) 
“Ty 
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vi X (v2 X r) = Vi rV — Vie Vor (2.2.17) 
and 


V2 X (v, x (-r)) = =v" TV; + V; © Vor; (2.2.18) 


the two expressions are only opposite when 
Viery = very =0 or r x(vxv)=0 


that is, either vz || vı, or r 1 to the plane of v, and vz. We shall see later 
that in spite of this inequality of action and reaction, momentum can still 
be defined for electromagnetic systems and is conserved; however, one 
must add to the particle momentum a field momentum that gives overall 
balance. 

We should, however, check for action and reaction in the case of 
circuits carrying steady currents. Thus, for the force of circuit 2 on circuit 
1, we have 


I 
Foon = = f dl, X Bz(rı) 
c 


LE ai dl, x p (= aiis Text) (2.2.19) 
ry 
or 
Il ror dl-dl 
Front -t| $ andan n 2) a =< (r, rə); (2.2.20) 
c Ir, = r| Fi2 
C2 Ci Cyr C2 
The first integral is zero, since (r,;—12)/riz=—Vil/ri2 and 


A di, - V; 1/ri2 = 0 for every rz. Thus, 


IL dl, -dl 
Foon = z f f ! 3 z (rı — r2) (2.2.21) 


Cy C2 


which evidently satisfies the law of action and reaction. We can also read 
off the sign of the force for J, and J in the same sense: It is attractive. 
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We turn now to different ways of expressing the basic formula (2.2.4). 
As we pointed out earlier, Jd1 = nqvd*r, or 


Id\ = pvd*r = jd*r (2.2.22) 


where dîr is the volume element of the wire, p the charge density of the 
moving charge, v its velocity, and j the current density. More directly, 


Id = jAdl = jdr. (2.2.23) 


Therefore, a continuous distribution of current—imagine many wires lined 
up together~generates a field 


B(r) = [arte x oP (2.2.24) 


and a field B exerts a volume force on a current j of 
f=}xB, (2.2.25) 
c 


“Volume force” f means that the actual force on an element of current 
occupying a volume V is 


F= f drf. (2.2.26) 


Vv 


The formulae we have derived are sufficient to calculate the magnetic 
field B of a given steady current distribution. Since the current is steady, 
we must have, in the absence of an indefinite piling up of charge density, 
V-j=0. 

We note that (2.2.24) is 


B(r) = — [ari ga (2.2.27) 
c jr—r'| 
or 
B(r) = V x far je) (2.2.28) 
c Ir-r] 


From (2.2.28) we can derive the differential equations satisfied by B 
[analogous to (1.1.17) and (1.1.36) for the electric field]. 
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First, 


V-B=0 (2.2.29) 


lyy. [gr I) _ Lyf gp JE 
yy far v far (2.2.30) 


r-r'| c Ir-r 


The first term vanishes, since V - j = 0. To see this, note that 


v far = f avv Tje’) 
r-r] Ir-r 
=- farv | 7 jr’) 
1 ; 
= far V'-jr)=0. 
Ir- r'| 


The second term in (2.2.30) is (47/c)j(r). The source equation for B is 
therefore 


vx pasts (2.2.31) 
c 


Ampère’s circuital law follows directly from (2.2.31) by integration over 
any open surface bounded by a curve C: 


[vxp-as=** dS-j (2.2.32) 


S sS 
or by Stokes’ theorem, 
f B-dl= ms Lenclosed » (2.2.33) 
c 
€ 


where Jenciosea is the current flowing through the surface S. 
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Finally, (2.2.28) invites us to define a vector potential 


A= far AE). (2.2.34) 
c jlr- r'] 
with 
B=VXA. (2.2.35) 


Note that (2.2.29) implies the possibility of introducing a vector potential, 
for which (2.2.34) supplied an explicit formula. 

The notion of gauge invariance makes its first appearance here. We 
can always add the gradient of a scalar Vy to any A without changing the 
field B, since V x (Vw) = 0. The choice of A among all these possibilities 
is called the choice of gauge. The transformation 


A>A+ Vw (2.2.36) 


is called a gauge transformation, and B is said to be gauge invariant. The 
gauge choice in (2.2.34) is evidently V- A = 0. Note that (2.2.36) informs 
us that we can always find a y to make V-A=0. Suppose V- Ay #0. 
Then let A, = Ao + Vis. We can make V- A, = V- Ao + V7 = 0 by solving 
the equation V*y = —V- Ao, which is always possible. 

The underlying theories that physicists work with today are all theories 
with gauge invariance under transformations similar to (2.2.36). Not 
surprisingly, they are called gauge theories. In particular, the so-called 
standard model of the strong, weak, and electromagnetic interactions has 
all interactions mediated by gauge fields: eight colored gluon fields for the 
strong interactions, the W~ and Zo fields for the weak interactions, and 
of course the electromagnetic field. 


2.3. MAGNETIC MULTIPOLES 


Starting from (2.2.34) for the vector potential 


A(r) = ‘| dy UA (2.2.34) 
; lr—r'| 


with V-j = 0, we can expand A(r) in a power series in r’/r (convergent 
for r’ <r); 


i 


A(r) = 15 ED farie’) (r’- V) 
Cx 


l 
7 i, (2.3.1) 
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We consider first the terms /= 0 and /=1, and then go on to the 
general term. First, / = 0: 


Ar) = : f dr’j(r’). (2.3.2) 


A‘ (r) can be seen to be zero from the identity 


Oj, ; 
0= farn- = [ar nie = | dejar). (2.3.3) 
k 


Xa Ox 


Thus, there is no vector potential of order 1/r for large r and, hence, no 
magnetic field of order 1/r*. This conclusion has nothing to do with the 
nonexistence of magnetic poles. If magnetic poles existed, there, of course, 
would be magnetostatic fields going like 1/r*, where r would be the 
distance from the pole. There still would be no 1/r? field generated by 
steady electric currents. Note that a single moving point charge does not 
constitute a steady current and, hence, will give rise to a 1/r? field. 
The term / = 1: 


A (r) = -1 f de jer) ye (2.3.4) 
c r 
It is useful here to change to tensor notation: 


l 
AMP) = -1 f dr jeri is. (2.3.5) 
c Ox, Y 


5 k 
We proceed by decomposing the term j;x; into symmetric and antisym- 
metric parts: 
a , 1 fo. , . t 1 $ 1 $: t 
jixk = 7 ork + nxt) +5 (erk = jaxi). (2.3.6) 


The first term in (2.3.6) integrates to zero, as in (2.3.3): 


ja | drain i= = | arira RaR (2.3.7) 
OX; 
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The second term in (2.3.6), inserted into (2.3.5), gives 


AM(p) = — ` f art )x -jh dt) ð i (2.3.8) 
c 2 OX, r 


We define the magnetic moment density M by 


M(r’) = EX Jr) (2.3.9) 
so that, since 
1 + . 1 ve . į 
30 x) x V=—[-r'j-V +j V), 
2 2 
AV (r) = - far M(r’) x vi (2.3.10) 
r 
1 
=—-mxV- (2.3.11) 
r 
where m is the magnetic moment, 
m= [Mear (2.3.12) 
The magnetic field of the moment m is 
BO = Vx A= -Vx (mx V2) (2.3.13) 
r 
or, since V*(1/r) = 0 for r #0, 
Ms 1 
BY’ =(m-V)V- 
F 
1 
= V(m: V)- 
F 
=-yat (2.3.14) 
y 


corresponding to a magnetic “potential” 
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m'r 
r? 


o*(r) = (2.3.15) 


In general, of course, one cannot express B as the gradient of a 
potential, since V x B + 0. However, in a region where V x B = 0, that 
is, outside of a current-carrying region, one can define a potential ¢*, 
called the magnetic pseudopotential, such that 


B= -V¢*. (2.3.16) 


We discuss the magnetic dipole further by considering two special 
cases: 


1. A circuit carrying current J: 


m=5-[arexi=siprxal (2.3.17) 
2c 2c 
Cc Cc 


where dl is in the direction of the current, or by Stokes’ theorem, 


nook, (2.3.18) 
c 


where A is the “area” of any surface bounded by the circuit C: 


a= fas, 


with the direction of dS determined by the right-hand rule applied 
to the circuit C. 


2. A point particle of charge q and velocity v in orbit, radius vector 
rp: 


A= + | ae xj) => | argòt - rp) r X vp 
2c 2c 
or 


n-i ql, 


2.3.19 
2c 2mc ( ) 


where l, is the angular momentum of the particle. The factor q/2mc 
is called the gyromagnetic ratio of the particle. Note that in this 
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example j(r, 7) = qv,6(r — r,(t)) is not a constant. Nevertheless, 
the formula for the magnetic moment turns out to be correct if one 
first time-averages m over the fast orbital motion. 

The general term, (2.3.1), can be rewritten in tensor form: 


ð a1 
[ar ieai aes eS, (2.3.20) 
OX, OX; r 


Ayn = Pt 
; 


i 
The above expression for A (r) permits the construction of a pseudo- 


potential that we give without proof (but see Problem 2.12 for the case 
1 = 2): 


2l _,M(’)-r 
(pe) = — —1) | dr' (r Vy DS "2.3.21 
POR oy | weer. 232) 
Recall the analogous electrostatic formula 
_1)/ 
o'r) = CO | ae p(t’) ae yyt, (1.3.1) 
! r 


which showed that the potential of an arbitrary charge distribution can be 
written as a sum of multipoles of order /, each multipole itself being an 
integral over r’ of Maxwell multipoles with the tensor structure 


electric _ Ay Ar 
Chey SLi. Vi, 


and magnitude 


AN 
Qir’) = dr r'p(r’) =, (2.3.22) 


Similarly, *(r) is given as a sum of multipoles of order /, each 


multipole itself being an integral over r’ of a Maxwell multipole of the 
tensor structure 


One: = Pi, co. Fi, Mi (2.3.23) 
and magnitude 


NeW i 


pmagnetic = 
Qi (+1)! 


M(r'}. (2.3.24) 


As usual, only the symmetric, trace-free part of Q8": contributes to 
y y p 
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the integral (2.3.21) for ¢*. Since there is no magnetic monopole, the 
Maxwell multipoles can be thought of as coming from successive displace- 
ment of a dipole, rather than a charge. 


2.4. MAGNETIC FIELDS IN MATTER 


We deal only briefly here with the averaging process since the essential 
issues are very similar to the electrostatic case. As in that case, we define 
an average field 


= farf — r')b(r') (2.4.1) 


where b is the microscopic field. The differential equations for b, 


Vxb= — bn (2.2.31) 
and 
V-b=0 (2.2.29) 
become, on averaging, 
V-B=0 (2.4.2) 
and 
VX B= ao (2.4.3) 
Cc 


It remains to calculate j,,. An applied magnetic field will induce a 
magnetic dipole density, the dipole moment per unit volume M. The 
vector potential due to M will be given by (2.3.10) 


Au = fary : - x M(r’) (2.4.4) 
|r-r'| 
ae hs x M(r’). (2.4.5) 


Evidently, V’ x M(r’) plays the same role with respect to j/c as -V - P 
does with respect to p». In integrating (2.4.4) by parts to arrive at (2.4.5), 
we have as usual dropped surface terms, understanding that they will 
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emerge from the behavior of V x M at an (approximate) discontinuity. 
Thus, we see that 


jn = CV XM + jy (2.4.6) 


where jy is the conduction current of moving charges. Calling j=j, we 
have from (2.4.2), (2.4.3), and (2.4.6), the field equations 


v-B=0 (2.4.2) 
and 
vxB= 174 dev XM. (2.4.7) 
C 


Analogous to our definition of D in electrostatics, we define 
H = B— 4M, (2.4.8) 
leading to the final form for (2.4.7): 


Aaj 


C 


VxH= (2.4.9) 


Together with the relation between B and H, (2.4.9) and (2.4.2) determine 
the magnetic field. 

A word on nomenclature: Before the electrical origin of magnetic 
fields was known, the electrical analogue to B seemed to be D, and the 
analogue to H seemed to be E, the differences in the right-hand sides 
reflecting the absence of magnetic poles and currents. Consequently, the 
historic name given to the vector H is magnetic field, that given to B is 
magnetic induction, one conventionally measured in oersteds, the other 
in gauss. We do not differentiate these units from each other. 

The boundary conditions on B and H at a material discontinuity follow 
as usual from (2.4.2) and (2.4.9). A Gaussian pillbox applied to (2.4.2) 
tells us that Byormai is continuous; a Stokesian rectangle applied to (2.4.9) 
tells us that 


4r 
AHrangeniia = et K 


where K is the surface current, that is, the current going through the 
infinitesimal Stokesian rectangle per unit length along the tangent under 
consideration. A surface current requires an infinite current density and 
hence infinite conductivity. For finite conductivity, we shall see that the 
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“surface” current is finite near the surface, and hence there is no 
AFfangentials even for a sharply bounded surface. 

If B is a linear function of H, B; = u; H;, where the tensor u; is called 
the permeability. As usual, the isotropic case has pw, = jp. 

We discuss briefly the relation between B and H. First, let us consider 
permanent magnets. These are endowed with a fixed dipole moment M(r) 
per unit volume, giving rise to an effective pole volume density —V - M 
and pole surface density Mnormai- (We do not discuss the atomic physics 
of permanent magnetization.) The equations determining the field con- 
figuration are 


vxn=$ <0 (2.4.9) 
and 
V-B=0. (2.4.2) 
Thus, since j = 0, 
H = —V¢* (2.4.10) 
V-H=-—47V-M, (2.4.11) 
and so 
V76* =4rV-M (2.4.12) 


plus boundary conditions determines ¢* and, hence, H and B. 

Second, let us consider paramagnetic substances: > 1. These consist 
of atoms possessing permanent magnetic dipole moments, which in the 
normal state are randomly oriented, cancel out, and average to zero. The 
presence of a magnetic field will polarize the moments and tend to align 
them with itself. A simple classical calculation shows this effect. Consider 
a gas of atoms, each having a permanent magnetic moment p, in an 
externally applied field Bo. The energy of the magnetic moment in the 
field is 


W=- p- Bo 
and the Boltzmann distribution function for p is 


eb BolkT 
F=———__—_. (2.4.13) 
ferran 


The mean value of p will be in the direction of Bo, so p = a Bo/Bo 
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and 


j f dOp - Bo et PT 


TS = ; (2.4.14) 
Bo Bo faa et BAET 


To obtain some notion of the order of magnitude of u Bo/kT, we take 
4 to be one electron Bohr magneton (ef/2mm,c) and measure Bo in tesla 
(10* gauss) and T in degrees absolute. We then find 


HBo _ | Bo 


, 2.4.15 
kT 2T ( ) 


so that even for high fields and low temperatures u By/kT is quite small. 
The calculation of 4 is very simple for u Bo/kT << 1. From (2.2.14) 


| t 


m | wer # Bulk dw w? dw 
2 2 
= L Bo Bo 
ee 2.4.16 
k i kT i kT ( ) 
f emma raw | dw 
=i -1 
and the atomic paramagnetic polarizability is' 
p? 
äp 2.4.17 
are ( ) 


Let us see what this volume is. With u = ef#/2mc, where e and m are the 
charge and mass, respectively, of an electron, 


= nee) 
7 3kT\2me 


2 3 / 2,2 
=-— (e) (2.4.18) 
BarkT 4 \hc 
1 
Sn (2.4.19) 


'The formula (2.4.17) is called the Langevin-Debye equation. It evidently holds equally 
well for a collection of freely rotating electric dipoles. 
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with T in degrees absolute. Here, a, is the Bohr radius, ay =x 1078 
cm. 

Thus, except for very low temperatures, the paramagnetic suscepti- 
bilities are small compared to the electric ones. Note that the high T limit 
calculated here is necessary for the classical calculation to be valid; that 
is, kT must be larger than the alignment energy u Bo, since otherwise the 
integral over angles, dQ, would have to be replaced by a sum over discrete 
levels. 

Third, we consider diamagnetic materials. Diamagnetism is present in 
all matter; however, it is dominated by paramagnetism when the latter is 
present. 

Diamagnetism comes from a basic property of magnetic interactions 
(called Lenz’s law): They oppose any change in the magnetic field. The 
mechanism used is electromagnetic induction, which will be addressed in 
Section 2.7.” A careful calculation of the polarizability of an electron orbit 
must wait until then. However, we can make a rough estimate of the 
order of magnitude. The magnetic moment p of an orbiting electron in 
an atom, we have seen in (2.3.19), is 


p= ile (2.4.20) 
2mc 


where L is the angular momentum of the electron. L, in turn, will have 
a component that is proportional to B and will not average to zero over 
many orbits. Dimensionally, 


Lx mr’w (2.4.21) 


where r is the radius of the orbit and w a frequency of rotation caused 
by the magnetic field. This characteristic frequency’ associated with the 
magnetic field is 


ES oe (2.4.22) 


mce 


*In fact, one is treading on dangerous ground in attempting a quantitative classical 
calculation of diamagnetism, since it is a famous theorem of classical statistical mechanics 
that a temporally constant magnetic field can have no effect on a thermodynamic system at 
equilibrium: Paramagnetic and diamagnetic effects cancel. Therefore, some consequences of 
quantum mechanics must be added to the calculation. For example, the derivation given 
above of (2.4.17) assumes the existence of a permanent unique magnetic moment—not 
possible in classical mechanics. The derivation to be given in Section 2.7 assumes the 
existence of unique orbits and time scales for the ficld free motion of the system, which is 
again not possible in classical mechanics. 

*Think of the equation mi = F + (e/c)F x B, and note the characteristic frequency we 
contained in #/r. F in this equation stands for the sum of the electric forces on the electron. 
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giving 
> eB č e 


u ~- mZ =. B; (2.4.23) 
2mc me me 


Thus, the diamagnetic polarizability of an atom is very roughly 
&n ~ — aR, (2.4.24) 


which is smaller than the electric polarizability by a factor 
(eImê)lapg = (1/1377. It is also smaller than the paramagnetic polariz- 
ability by a factor 7/(137)?, so that diamagnetism is normally observed 
only in materials for which the paramagnetic susceptibility vanishes. This 
occurs when the intrinsic atomic or molecular moment p [as in (2.4.17)] 
vanishes identically for reasons of symmetry. 


2.5. MOTIONAL ELECTROMOTIVE FORCE AND 
ELECTROMAGNETIC INDUCTION 


If we move a conductor through a magnetic field, the ev/c x B force will 
act on electrons, giving an effective electric field E, = v/c x B. Thus, in 
general, if a circuit in a magnetic field is displaced, we can expect an 
effective electric field to be generated in the conducting wire, and we will 
find an effective electromotive force 


Vv 
y= bE, dl=>-xXB-dl. 
á $ $: (2.5.1) 


Cc C 


Note here that v is the velocity of displacement of the circuit element dl. 
We define the magnetic flux through the circuit 


o=| Bas, (2.5.2) 


S 


where $ is any surface bounded by C, and the normal dS and the direction 
of circulation dl are connected by the right-hand rule. Note that since 
V-B=0, ® is independent of the surface S. 

As we displace and deform the circuit, the flux ® will normally change. 
We now show that that rate of change determines the effective electro- 
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motive force via the equation 


g=- -—. (2.5.3) 
Note that (2.5.3) is not Faraday’s law, since the magnetic field is not 


changing with time. 
With B given by a vector potential A, the flux ® is 


o=[Ras=$a-di, (2.5.4) 
S C 
and, as we change from contour C, to contour C3, we have 
bb = pAa-di~ fA. (2.5.5) 
C2 Ci 


We parametrize the path of the contour with a parameter 7 such that 


x=x(7), OSrsl,  x(1)=x,(0). (2.5.6) 
Then 

L 

5d = | drA;(x + ôx) (= + a) z fara pf (2.5.7) 

dt dt dt 

0 
1 

7 | ar| Aix) a E p Lely fe mat (2.5.8) 

dt = ax; dt 


0 


We integrate the first term in (2.5.8) by parts; the integrated term is 
zero, because 6x and x have the same values at r = 1 and at r = 0. There 
results 


50 = cabs a a OU oy (2.5.9) 
Ox; OX; dt 


0 


We recognize that (dx;/dr) dr = dx; (i.e., the dl; of a line integral) and 


(4 0) drôr = Bx x dl. 
Ox; Ox; 
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Equation (2.5.9) then becomes 


sp = - | xx Beal (2.5.10) 
and 
1d® . 
-142 [Y xn a= Eer, (2.5.11) 
c dt c 


which is the equation we set out to prove. 

Faraday’s law of induction was, of course, a great experimental discov- 
ery. Nevertheless, it is interesting to observe that it follows from (2.5.11) 
and the assumption of Galilean (or of Lorentz) invariance. Imagine a 
magnet and a circuit. Consider two operations. First, move the circuit 
between the pole pieces of the magnet. An electromotive force €,, given 
by (2.5.11) will appear in the circuit, a corresponding current / = €,,/R 
will flow, and the total charge transferred (e.g., deposited from an electro- 
lyte or measured by a ballistic galvanometer) will be 


1 . 1 
= | dtl=— | dté,;= —-— AÈ, 5, 
e | al ! Re oe) 


all this no matter how slowly the circuit is moved. Second, move the 
magnet past the circuit with equal and opposite velocity. (In the rest frame 
of the magnet, this operation looks the same as the first one.) If we are 
not to be able to tell one reference system from another, the second 
operation must induce an electromotive force in the circuit having the 
same value as r In this case, however, the electromotive force is 
genuinely induced, as discovered by Faraday. The law is the same as 
(2.5.11), of course: 


g=- 5, (2.5.13) 


where now @ is a true electromotive force and d®@/dt the rate of change 
of flux through the circuit. Obviously, we could have a third operation, 
where both the circuit and magnet move in opposite directions. The result 
is again the same equation (2.5.13), now with € = total electromotive 
force—true and motional—and d@/di the total change of flux through C, 
whether from the motion of C, or a change of magnetic field, or both. 

We deduce the differential equation corresponding to the integral 
relation (2.5.13) by holding the circuit fixed. Then (2.5.13) is equivalent 
to 
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1 a 
-i fas B= gga [ds-vxe (2.5.14) 
t 
S Cc Ss 
for any surface $; hence, 


10B 
VxE=--—. (2.5.15) 
c ât 


This is the differential form of Faraday’s law. Note that E has now acquired 
a curl. 


2.6. MAGNETIC ENERGY AND FORCE 


Analogously to our procedure in electrostatics, we calculate the rate at 
which the induced electric field acting on the current causes a loss of 
magnetic field energy. It is 


W 
-X farei, (2.6.1) 
dt 


We transform this expression as follows: 


=i [drexv-n (2.6.2) 
4a 


and, after dropping a surface term that vanishes, provided E x H goes to 
zero faster than 1/r?, we obtain 


-£ faex 7n, (2.6.2) 


where the symbol V means the gradient differentiates E, but is algebraic- 
ally to the right of E. We remedy this position by putting V algebraically 
to the left of E and changing the sign. Equation (2.6.2) then becomes 


3B 
BS dry xE:H= -+ | ara? (2.6.3) 
4r ðt 
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for the loss of energy by the field. As in the electric case, we can integrate 
(2.6.3), provided the medium is linear and the permeability tensor sym- 
metric. We find 


w= farn. (2.6.4) 


T 


We can discuss systems of current circuits, analogous to conductors 
in the electric case. We limit ourselves here to nonmagnetic media. The 
analogues to charge and potential are flux and current. To see this, we 
introduce the vector potential A: 


w= [argen 
ET 
1 
=A [drvx an 
8a 
1 
=- [dra-vxH 
8a 


1 
=— | drA-j 
A 4 


which for a system of circuits becomes 


1 
w=—> na -dh (2.6.5) 
2c i 
Ci 
or, by Stokes’ theorem, 
1 
W=—D/1, (2.6.6) 
2c i 


where /; is the current in and ®, the flux through the ith circuit. 
As with charge and potential, the fluxes and currents are linear 
functions of each other: 


p =c È Lil. (2.6.7) 
i 


The L:s are called coefficients of induction. Note that Ly is defined so 
that the electromotive force in C; induced by a changing current in C; is 
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al, 
6 = Ly (no sum). (2.6.8) 


We list several sometimes useful expressions for the energy W of a 
system of current carrying circuits: 


1 
=— 2, 1,9; 
a 


1 
==) GLL (2.6.9) 
2 iy 
and defining a matrix G; which is the inverse of the matrix L, we have 


w=—do, G;®;. (2.6.10) 
c? iy 


For an extended current j(r), in the absence of a magnetic medium 
(u = 1), 


w=> farai 
2c 


=55 fararo io. (2.6.11) 
2c |jr—r'| 


The contribution to W in (2.6.11) from two separated circuits, 1 and 


2, is 
-dì 1,1 
waden o 
|r- r2| c 


=> -3$ f ea (2.6.12) 


The idealization of line currents in (2.6.12) cannot be made for the 
diagonal element of the inductance matrix. (See Problem 2.16.) 
The generalized force on a circuit is given, with é, a generalized 


so that 
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displacement of the circuit, by 


8w 
F, = - — 


ôÉk 


_8W 


constant P Ôëk 


constant / 
(as we have seen in the electrostatic case). So 


1 


3 ro pee (2.6.13) 


k 
in agreement with (2.2.21). 

As in the electric case, the calculation of forces on material bodies is 
difficult. As in the electric case, however, we can find the total force on 


all the current (free or bound) in a region surrounded by vacuum as 
follows. The total force on currents inside the volume V is 


-1 fari», (2.6.14) 
c 


v 


which we transform into 
1 
r= | an x B) XB. 
4r 

In tensor notation, 

1 

F, = f dr(Biôi B: B,0; By). (2.6.15) 
T 


since 0, B, = 0, we rewrite (2.6.15) as 


Fe | dr i,( BaB,- ČB?) (2.6.16) 
4r 2 


leading to a magnetic stress tensor 


1 B? 
r4 =>(8 B: — òa =) 2.6.17 
ps k KS ( ) 


T 


and a total force on the currents inside V 
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F= | as, Tie (2.6.18) 


S 


where S is the surface enclosing V. dS, is, of course, the outward normal. 
Since the surface S in (2.6.18) is in vacuum, the microscopic and macro- 
scopic B are equal. 


2.7. DIAMAGNETISM 


We return now to the question of diamagnetism, which was discussed 
qualitatively in Section 2.4. 

We consider an isolated atom, containing n electrons and a heavy 
nucleus. Imagine applying a constant magnetic field Bo to the atom. A 
famous result, due to Larmor, is that, to first order in Bo, the system looks 
just like the same system with By = 0, but rotating gently with a rotational 
angular velocity (called Larmor precession) 


eBo 


; 2.7.1 
2m,C ( ) 


QO, = 


where e is the charge of the electron (negative!) and m, its mass. We 
prove the result by considering the equation of motion for each electron 
e: 


d'r v 
“=F,+e-xB 2.7.2 
dt’ c i ( ) 


Me 


where F, is the total remaining force on the eth electron, assumed velocity- 
independent and angular-momentum conserving, thus, for example, elec- 
trostatic. 

What happens when we rotate the system about a point with a fixed 
angular velocity œ? We specify a vector A as 


A =îA, + ĵA, + kA, (2.7.3) 


where A,, Ay, A, are the components of A along the rotating axes. The 
rotation of the coordinate axes is described as 


=f —*=0Xxj, —=o0Xk (2.7.4) 
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so that 
—=wxAt+— (2.7.5) 


where 


dA = GAs + j dA; r 
d[t] dt dt dt 


(2.7.6) 


That is, dA/d[t] is the rate of change of A seen by an observer in the 
rotating coordinate system. Applying (2.7.5) to (2.7.2), we have 


2 
m( 4 Te Gis x TE tx (w x te)) = F E| SEE tw x r| x Bo. 


dft}? dit] cLd{t] 
(2.7.7) 
If we set œw = w, = —eBo/2m.c and neglect quadratic terms in œ or B, we 
return to the B-less equation 
d’r 
m,—— =F,, 2.7.8 
al? (2.7.8) 


as stated earlier. Thus, the solution of (2.7.8) for the components of the 
r's are as if there were no field and no rotation. The neglect of quadratic 
terms is justified if wr <dr/d{t] and w,dr/d{t| <d7r/d{t]?, which for 
ordinary atoms and magnetic fields is true. 

The velocity of each electron is now 


= +O, XT; (2.7.9) 
the angular momentum of each electron is 


1. = m.r, X dre = MEFe X di, + m,r, X (1 X Tre) (2.7.10) 
dt d{t] 


and the magnetic moment of each electron orbit is 


isi as oe BEE pce cet (2.7.11) 
2mn.C 2c d{t] 2c 


Of course, we have taken the origin of r (i.e., the center of rotation) at 
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the center of the atom. Adding up all the electrons in the atom and 
averaging over atoms [assuming that the first term on the right-hand side 
of (2.7.11) averages to zero], we have for the average dipole moment per 
atom m 


e 
m= — 2B. e” Bor. 2.7.12 
os marl? o= (fe: Bodre) i l 
e? > 
ae ae (z £2) Bo (2.7.13) 
6MeC e 
for a diamagnetic polarizability 
ap De r? (2.7.14) 


n e 


in agreement with our earlier estimate. 

It remains to be shown that as the magnetic field is turned on, starting 
from zero, the induced electric field converts the original motion of the 
system to the Larmor precessing motion. 

We proceed by deriving a differential equation for the time depen- 
dence of the magnetic moment of an atom, averaged over many atoms 
and many cycles of the fast atomic motion. 

The equation governing the motion of the electrons in an atom is 


d’r, 
m 
dt? 


=F, + vex Be + E.) (2.7.15) 
c 


where F, is the force of the nucleus and other electrons on e, B, the 
magnetic field acting on e, and E, the induced electric field acting on e. 
We neglect nuclear motion and take r, to be the radius vector from the 
nucleus to e. 

We take the cross product of r, with (2.7.15) and sum over e. Since 
the force F, is angular-momentum conserving, it drops out and we have 
the equation 


L E 


7 ToD xed" x B. +E.) (2.7.16) 
t e 


where L is the total angular momentum of the electrons about the nucleus. 
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The triple vector product r, X (v, X Be) is 


re X (v, X Be) = re © Beve — re + V.B, 


_ re: Bove + Bet Vre 4 re © Beve — Bo © Vere 


re © VB 
2 2 
eee e E eere -B. B.) 
2 
- tee e re i (2.7.17) 
2 dt dt 


so that 


LSNA (5 <1.) ESD (rate Bi - 1B.) 
dt 2e (63 2c dt e 


iB dB 
teb (rex Be- + (re C r? ‘)). 2.7.18 
: Tua dt ore) 


We simplify (2.7.18) by assuming that the spatial variation of B is 
small over the atom, so that B, can be replaced by B, and dB,/dt by B/at. 
We cannot do the same for E,, since the coefficient Sr, will average to 
zero over many cycles of the fast motion and over many atoms. We 
therefore expand E, about the nucleus of the atom: 


1 aB 
E. = E nucleus EXE ôE (2.7.19) 
2c ðt 


where ôE is linear in r., but such that 


by re xX dE 
averages to zero (see Problem 2.17). The result is 


a T EED T T ONAE 
m dt 2c e e 


+e% (r. x Hr x 2E) = AG ie *) (2.7.20) 
Cc 


ðt 2c ôt ot 


or 
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dM d e 
TI = wo xM +E 


a ET 2 (Tere: B — r2B) + edn. x ôE. (2.7.21) 


We average over atoms and find 


dM “i e _ 
TT =), XM —-— 2 2). 2.7.22 
dt j dt\6mc* z P ( ) 


where M = 3m, is the total magnetic dipole moment, M = eL/2mc. 
We can solve (2.7.22) easily for a field B in one direction, say, z. In 
that case, starting with M, and B that are both zero, we find 


2 = 
M. = -— bB, (2.7.23) 
OMC e 


confirming our earlier guess, (2.7.14). Evidently, the component of M in 


the x,y plane precesses around the field with the precession frequency 
w,, as expected. 


CHAPTER 2 PROBLEMS 


2.1. Show that the functions p(x,t)=qf(x-—x,(t)) and j(x, t = 
q(dx,/dt) f(x — x,(t)) with x,(t) an arbitrary function of ¢ satisfy the 
continuity equation 


0 
Piy. j=0 
ot 


and are therefore possible candidates for charge and current density. 
For a point particle, the function f(x — x,(f)) would go over to 
(x — x,(2)). 


2.2. Two closed metal surfaces are immersed in a conducting fluid. Con- 
struct and prove a uniqueness theorem for the current and field. 


2.3. Two metal spheres, one very small, the other of radius b, are 
immersed in a conducting medium of conductivity ø. The centers 
of the spheres are separated by a distance L. The small sphere has 
a current Jo flowing out of it; the sphere of radius b is maintained 
at a potential V with respect to a very large conductor containing 
the system. What is the total current Z flowing into the large sphere? 


2.4. Current J enters an “infinite” thin conducting plane (of conductivity 
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2.5. 


2.6. 


2.7. 


2.8. 


2.9. 


2.10. 


2.11. 


2.12. 


Steady Currents and Magnetostatics 


g) at a point and departs at infinity. Assuming the flow is uniform 
across the width A of the thin plane, give a formula for the electro- 
static potential at a point p, ¢ in the plane. 


A circular hole of radius a is cut from the plane of Problem 2.4. 
Its center is a distance b > a from the entrance point of the current. 
Using the method of Problem 1.21, find a formula for the electro- 
static potential at p, ¢. 


A current / of uniform current density flows down a circular cylindri- 
cal wire of radius b. Using Ampére’s circuital law, find the magnetic 
field at a distance p from the center of the wire, for p < b and 
p> ob. 


Imagine a uniform current Jp flowing in the z direction for 
\p—pi|<b and in the -z direction for |p—p2|<b, where 
|p: — p2| < b. Draw a picture of the resultant current distribution 
and give a formula for the (uniform) magnetic field in the overlap 
area. Now let |p, — p2|— 0 and Jy) — œ so that Jo|p; — p2| remains 
finite. Give a formula for the resultant current distribution as a 
function of p and ¢. 


Calculate the field B inside and outside of a perfect solenoid, that 
is, an infinite thin cylinder of radius a carrying a uniform circulating 
current density j = k x pJ8(p — pa), where is the unit radial vector 
from the axis of the cylinder to the point p. & is a unit vector parallel 
to the axis of the cylinder. Check that your answer is consistent with 
Ampére’s circuital law (2.2.33). 


Calculate the B and H fields of a magnetized spherical shell of radius 
b with a constant dipole moment per unit volume M. 


Calculate the B and H fields on the axis of a circular cylindrical 
magnet of radius a and length h with a constant dipole moment per 
unit volume M. Give the fields both inside and outside the magnet. 


A spherical shell of radius R carries a uniform charge distribution 
of surface charge density ø. The shell is rotated about an axis with 
constant angular velocity w. Find the magnetic field B inside and 
outside the sphere. 

Suppose you wish to wind a current-carrying wire around a 
sphere in such a way that the field inside the sphere is uniform. 
How should the wire be wound? 


Study the next approximation in r'/r to the magnetic field of a 
confined current. That is, from the general formula 

aiy=t{ ar He 
c 


Ir -r'| 


2.13. 


*2.14. 


2.15. 
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find the term following the dipole field. In particular, find the 
magnetic scalar potential and show that it is a quadrupole field. 


A current distribution J(r) exists in vacuum in the space z > 0, all 

x and y. The space z < 0 is filled with a medium of permeability w. 
Show that a B field satisfying the field equation and boundary 

conditions is given by the following: 

(a) For z > 0, the unperturbed B field of the current J(r), to which 
must be added the B field of an image current L(r) with 


Lx, ys z) =a (u a 1)/\(u + Jx, y> =z), 


Ly (x, YX, z) = (u E 1)/(u + 1)J,(x, y, =z); 


and 


LAX, y, z) > ~(u ~ 1)/(u + 1)J-(x, Y, =z): 


(b) The B field for z <0 is given by the unperturbed field of an 
image current 2u/(1 + u)J (r). 

(c) The equations do not appear at first sight to lead to a unique 
solution. What makes the solution unique? 


A superconductor behaves like a perfect diamagnet, that is, a 

material with »=0. An interesting property of a superconducting 

sphere is that it is repelled by a magnetic field and, therefore, can 

be balanced above a magnet. 

(a) Simulate the magnetic field as resulting from a very long circu- 
lar cylindrical magnet of radius b and uniform magnetization 
M, with the —M pole far enough away to neglect. The super- 
conducting sphere, with radius a, is placed at a vertical height 
h above the center of the positive face of the magnet. Calculate 
the force on the sphere. Take a << h and carry out a multipole 
expansion of the B field generated by the superconducting 
sphere in the presence of the magnet. Then integrate the stress 
tensor around the sphere to get the force. You will have to 
keep the dipole and quadrupole terms to obtain a nonvanishing 
magnetic force. 

(b) Study the vertical stability of the suspension. 

(c) Study the horizontal stability of the suspension. (Answer: 
Stable for b/h > V24.) 


Assume that force F is invariant (up to and including terms of order 
u/c) to a change of coordinate system going from one observer to 
another (the primed observer) with relative velocity v. From this, 
assuming E’ = E + O(v/c), B' = B + O(v/c), and the Lorentz force 
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law, show that 
v v? 
p-E+*xp+o(5), 
c c 


A current is carried by a wire of radius r. The wire is in the form 
of a circular loop, of radius a, with a œ> r. 


(a) Calculate the self-inductance L of the loop accurate to order 
1/log(a/r). 


(b) From this result, estimate the tension in the wire when carrying 
a current 7. Make sure your answer is a tension. 


Complete the argument leading from (2.7.18) to (2.7.20) by showing 
that Sr, x SE, where ôE is linear in re, averages to zero over many 
cycles of fast motion and over many atoms. 


Solve Eq. (2.7.21) for the M,, the components of M perpendicular 
to w,, given the initial value of M,. Assume o, is in a fixed 
direction, but has arbitrary time dependence. 
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CHAPTER 3 


Time-Dependent Fields 
and Currents 


3.1. MAXWELL’S EQUATIONS 


The static and quasistatic (including electromagnetic induction) equations 
for E are 


ed ee (3.1.1) 
c ôt 

and 

V- E = 4rp; (3.1.2) 
for B they are 

V-B=0 (3.1.3) 

and 

4rrj 


(3.1.4) 


Equations (3.1.3) and (3.1.1) are clearly consistent; however, if p is 
time-dependent, the continuity equation, (2.1.2), tells us that V -j will not 
vanish, and (3.1.4) will be inconsistent. We repair this inconsistency in 
the simplest possible way. We add a term 47j/c to the right-hand side of 


31 
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(3.1.4): 
47, 
VxB=-——(j+j’). (3.1.5) 
c 


Consistency requires 
V- (G +j)=0 


or, from the continuity equation, 
(3.1.6) 


Assuming (again, the simplest assumption) that (3.1.2) still holds, we find 


yt 
? 4r ot 
Bigeye (3.1.7) 
4r ðt 
from which 
1 3E 
j=--—+Vx 3.1.8 
3 år ot Q ( ) 


where Q can be any vector. Again, the simplest assumption is Q = 0. This 
yields a new, consistent equation to replace (3.1.4): 


13E 4rj 
== + e5, 
c ât c 


vVxB (3.1.9) 


Maxwell called the new current j’ the displacement current. Together with 
j), the “total” current is “conserved,” in the sense that no flux of j + j’ 
emerges from a closed surface, that is, V- (j + j’) = 0. 

Maxwell’s full equations require us to look in a new way at the causal 
relationships between the fields. 

First, observe that two of the equations involve time derivatives: 


aE 
“= eV xB- 4rj (3.1.10) 


ot 


and 
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ae —cV XE. (3.1.11) 
ot 
It follows that the right-hand sides of (3.1.10) and (3.1.11) tell us how E 
and B change. This is not the way the static equations led us to think: 
Current was viewed as causing a magnetic field and changing magnetic 
field as causing an electromotive force. When we come to the study of 
radiation, we will see how our normal way of thinking about causality 
can be restored. Nevertheless, as a mathematical physics boundary value 
problem, one must think of (3.1.10) and (3.1.11) as calculating oB/ot and 
dE/dt, given B, E, and j as functions of position at an initial time. 
Second, observe that taking the divergence of (3.1.10) and (3.1.11) 
leads to the time derivatives of the divergence equations (3.1.2) and 
(3.1.3). Thus, from (3.1.10), 


ô 
L E E ene or — (V -E —4rp)=0, (3.1.12) 
ot ât 
and from (3.1.11), 
P aap A (3.1.13) 


so that provided the divergence equations hold at one time, the equations 
for ðE/ðt and dB/ot will guarantee that they hold for all time. Clearly, 
therefore, the divergence equations should be viewed as enforcing certain 
boundary conditions on the E and B fields, whereas the time derivative 
equations are the dynamical equations. Just as in most physical theories, 
one formulates the time-dependent problem as the prediction of the future 
(or past) state from the present. For example, in mechanics, we specify r 
and dr/dt, and then predict the future course of r and dr/dt via Newton’s 
second law. Similarly, in quantum theory, we give the wave function at 
one time and use the Schrédinger equation to predict its value at another 
time. We see that the analogous problem in electrodynamics is to give E 
and B at one time, subject to the constraints V-E = 47p and V:B=0. 
Maxwell’s equations then predict the future (or past) values of E and B. 

This assumes, of course, that p and j are given functions of space 
and time, satisfying the continuity equation V - j + (ap/at) = 0. When we 
consider the interaction of particles and fields as complete dynamical 
systems, p and j can no longer be considered as given functions. Their 
time dependence must be calculated as well. We will, of course, come 
back to this. For the moment, however, we suppose that the sources p 
and j are composed of heavy objects (magnets, large capacitors, atomic 
nuclei, etc.) on which the reaction of the fields can be ignored. 
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3.2. ELECTROMAGNETIC FIELDS IN MATTER 


In order to obtain equations for the macroscopic fields, we proceed to 
average the microscopic fields e and b, as we did in Sections 1.6 and 2.4. 
In our earlier discussions of electric and magnetic fields in matter, we 
relied heavily on the explicit form of the integrals giving the static fields 
as functions of the charge and current densities. Since those integrals no 
longer hold in the time-dependent regime we are now considering, we 
must proceed differently. Instead of working with the solutions, we work 
directly with the differential equations for the microscopic fields. These 
are first the homogeneous equations 


V-b=0 (3.2.1) 


and 


Vxe= Spee. (3.2.2) 
ca 


The spatially averaged fields B and E clearly satisfy the same 
equations: 


V-B=0 (3.2.3) 
and 
ype E (3.2.4) 
c at 


Note that we average our ficlds over space, but not time. The space 
average is necessary to smooth the fluctuations of the microscopic fields 
in going from atom to atom. Therefore, the averaging function must 
extend over a volume that contains many atoms. On the other hand, we 
need the time resolution to be fine enough to describe light emitted by 
atoms—that is, to be finer than a characteristic atomic time. In dealing 
with normal atomic phenomena, there is, therefore, no need for, and 
nothing to be gained by, a time average. As emphasized in Section 1.6, 
it is necessary that the averaging volume, although large enough to contain 
many atoms, must be small enough to resolve the distances we wish to 
study. These requirements are easily compatible for visible light and a gas 
at normal temperature and pressure, as we show below, following (3.2.34). 

The inhomogeneous microscopic equations 
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V-e=47p, (3.2.5) 
and 
18 
yaba Epit (3.2.6) 
c Ot c 
are averaged to give 
V-E=47p,, (3.2.7) 
and 
19E = 
ee ae (3.2.8) 
c at c 


In order to carry out the required averages, we again separate p, and 
jm into bound and free components: 


Pm = Pf t+ Po (3.2.9) 
and 


jn. =j; + jp: (3.2.10) 


py and jp will be the macroscopically observed charge and current den- 
sities: 


pr=p (3.2.11) 
and 


j=j. (3.2.12) 


We continue with our model of Section 1.6 in which p, and jẹ, are assumed 
to come from neutral atoms or molecules. Therefore, we can write 


Po = È pal —,.1) (3.2.13) 


where r, is the location of the nth neutral atom or molecule. p,(x) falls 
rapidly to zero for x larger than ag, an atomic radius, and f dxp,,(x) = 0. 
Similarly, 


jo = Dj- ey. t). (3.2.14) 


However, here, as we shall see, we cannot [as we did in (2.3.3)] require 
S jx) dx = 0. 
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We now average: 
Py = 2 | dr’ fle- 0°) alt’ = ra) 
or, with r’ —r,, = x, 
P= dx f(r — r, — x)p,(x). (3.2.15) 


Since x is restricted by p, to be Sap, and f, the averaging function, 
varies on a scale that includes many atoms, we expand f in powers of x, 
up to and including the linear term. The first term vanishes by the assumed 
neutrality of p„. The second term gives 


m=) | dxxp,(x)-V,f(r—1,) (3.2.16) 
or 
bs = - V- P(r) (3.2.17) 
where 
P(r) = È f(r — 4) pn. (3.2.18) 


p. is the dipole moment of the nth atom and P the average dipole moment 
per unit volume. 

We may neglect the next term in the expansion, since we will be 
considering electric and magnetic fields that are weak enough that a linear 
theory suffices. Thus, the next term has a contribution that is independent 
of the electric field and does not contribute to our linear equation, plus a 
contribution which is linear in the electric field and does but is negligible 
compared to (3.2.16) for the reasons given in Section 1.6. 

For Jos we carry out a similar expansion: 


vee? | dx f(r ~ r, — x)§,(x) (3.2.19) 
=È fir-r) | ax j,(x) — faxx V, È f(r — rn)ja (x). (3.2.20) 


The first term in (3.2.20) involves 
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faxi AE f axxv j= fax PPa dpn (3.2.21) 
ot dt 


so that the first term becomes 


d ô 
=> fr- ry) Peas 
ve n r ) dt at 


(3.2.22) 


The second term we write using a tensorial notation. Its ith component 
is 


Tow = iE = È f(r = rn) jni(x) dx. (3.2.23) 


fk n 


We decompose xx jni into a symmetric and antisymmetric part. We neglect 
the symmetric part for the same reasons as those given above following 
(3.2.18). The antisymmetric part will, of course, give our usual magnetic 
dipole density. Thus, (3.2.23), antisymmetrized, becomes 


ae -524 oe f keai (8.2.24) 
n Or, 


ine -33 X | axi dx[in(x- V, f) — xin Ve) dx 


210 4) x f dx(x x jn) (3.2.25) 
“2 n 
or 


Ja =c È V, f(r- rn) x m, (3.2.26) 


where m, is the magnetic moment of the nth atom. Equation (3.2.26) 
then leads to 


Je =cV x È f(r—r,)m, = cV xM (3.2.27) 


where M is the magnetic moment per unit volume. Thus, we have 


oP 
pp=-V-P and J= + eV XM, (3.2.28) 
t 


88 Time-Dependent Fields and Currents 


The averaged inhomogeneous equations are therefore 


V-E=47p -4nV-P (3.2.29) 
and 
yxp=-1?E} en OE ee Ae vxm+ 2% (3.2.30) 
c at c ât c 


so, introducing the D and H fields, we have 


V-D=47p (3.2.31) 
and 
yene Pun (3.2.32) 
c at c 


Equations (3.2.31) and (3.2.32), together with the homogeneous 
equations (3.2.3) and (3.2.4), determine the boundary conditions to be 
imposed at a material discontinuity: 


AD normai = 470 (a = surface charge) 


AB normal =0 
AE tangential =0 (3.2.33) 
and 
A tangential = snk (K = surface current). 
Cc 


Together with the constitutive relations between E and D on the one 
hand, and H and B on the other, these equations and boundary conditions 
determine the time dependence of the fields. However, unlike the matter- 
free case, the fields at one time in the presence of matter do not determine 
the fields at all later times. This is because there is a finite time and space 
lag between the imposition of an electric field and the appearance of a 
nonvanishing polarization. Consequently, the relation between the macro- 
scopic electric field and the polarization will be 


t 


Pir, 1) = | dt'y.(r,t-U) Ej, t), (3.2.34) 


=i 


where y;; is the susceptibility tensor at position r. The relationship between 
E and P will be linear, as indicated, for fields that are weak compared 
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with the internal fields in matter (e/a ~ 5 x 10° V/cm). The space lag 
whose scale would be set by atomic sizes, ~10~* cm, is left out of (3.2.34), 
because our fields are averaged over a volume containing many atoms, so 
that whatever space lag existed would be washed out. 

That x depends only on ¢— 1?’ assumes that the affect of the field E 
at time 7?’ on the polarization at time ¢ only depends on the difference 
t— t', in a way that is independent of the time t’. We say (3.2.34) is time- 
translation-invariant. Equation (3.2.34) shows that specifying E at one 
time is not, in general, enough information to determine E at later times, 
since the integral over t' requires E to be known at all previous times. 
There are still physically obvious sets of consistent initial conditions, but 
they depend explicitly on the geometry under consideration. In addition, 
they require specifying the entire previous history of the system. A simple 
example is discussed in Section 3.5 and Problem 3.14. 

The ¢ — t’ dependence of y is governed by the characteristic frequenc- 
ies of atomic motion. Since these are the same frequencies that atoms 
radiate, it is necessary to keep track of the temporal nonlocality of y. 
Note that if we wish to describe the spatial variation of the fields, consis- 
tency requires that our averaging process allow us to resolve the length 
scale of wavelengths, which, in turn, must be much larger than atomic 
lengths to justify our assumption of spatial locality. This is the case for 
atomic radiation. With f the radiation frequency and A its wavelength, 
we have, very approximately, A ~ c/f ~ 47c/(e7/agh) ~ 1000 A. On the 
other hand, a 1000A cube of gas contains about 10° atoms at normal 
temperature and pressure. 

Equation (3.2.34) invites a Fourier transform. With 


1 iot f 
so- | dwe “E; (w) (3.2.35) 
and 
1 —iwt D 
P(t) = TF Í dwe "P;(œ) (3.2.36) 
mT 
we have 
P(r, w) = X(t, w) E,(r, œ) (3.2.37) 
where 


x 


X(t. @) = | dt xi(r, t) e. (3.2.38) 


-x 
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It follows that 


D,(r, w) = €;(r, w) Ej(r, w) (3.2.39) 
where €;;, the dielectric tensor, is given by 
€;, = 6 + ATX (3.2.40) 
and, of course, 
D(r, t) = E(r, ¢) + 47P(r, 1). 


Since the boundary conditions (3.2.33) must hold at all times, they must 
also hold for the Fourier transformed fields. 

The rules for complex conjugation of the Fourier transformed fields 
and dielectric constants follow directly from the reality of the fields them- 
selves: 


E(w)* = E(—w), etc. (3.2.41) 
and 


€;;(w)* = €;;(—w). (3.2.42) 


An important property of the dielectric constant follows from the 
presumption that if the field E is zero before t = 0, the polarization P and 
displacement D(t) should also be zero before t = 0. If E(t) = 0 for t< 0, 
then 


E(w) = = f dt' e Eyt’) (3.2.43) 
0 


and so E(w) is analytic in the upper half œ plane.’ Conversely, if E(w) is 
analytic in the upper half w plane, then 


E(t) = 


ş|- 


5 i dwe E(w) (3.2.44) 


vanishes for t < 0. This can be seen from (3.2.44) by closing the œ contour 


‘Strictly speaking, E(w) for real w is the boundary as w approaches the real axis of a 
function that is analytic in the upper half w plane. We see here for the first time the 
importance of analytic functions to notions of causality. 
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in the upper half-plane, presuming E(w) to go to zero sufficiently rapidly 
as w —> o, 
Similarly, 


~ 1 
D(w) = ——= | dre" Dit’ 3.2. 
(o) == | ar eoe) (3.2.45) 
ie) 
so that D(w) must also be analytic in the upper half w plane. Finally, since 


Dj(w) = €;(w) E;(w), (3.2.46) 


€,,(w) must also be analytic in the upper half-plane. One can rule out 
upper half-plane poles of €, since they would have to be compensated by 
zeroes of E. However, E(w) is essentially arbitrary (except for its ana- 
lyticity properties) and cannot be required to have zeroes at a predeter- 
mined value of w. Remarkably, the upper half-plane analyticity of e(w) 
is almost sufficient to guarantee causal propagation in a material medium 
(i.e., signal propagation with velocity limited by light velocity c). It must 
be supplemented only by the requirement that field energy can be lost, 
but not gained from the medium. This is shown in Problems 3.13 and 
3.14. We turn in the next section to a discussion of field energy in a 
dielectric. 
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We consider the force on charges and currents in the absence of dielectric 
matter. The total force on charges and currents inside a volume V is 


F, = { ae pe, + “Gi x B)|. (3.3.1) 


Vv 


We substitute V -(E/47) for p and c(V x B/47) — 1/47 aE/at for j to 
obtain 


r= |) gives -*(@ xB) --pex xy]. 6.3.2) 
4r c\ ðt i 4r 
vV 


We follow a by now familiar path: 


E;ðk Ek ee 0, (EVE, ) = E,(0,E; = Ex) = Era Ek, (3.3.3) 
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B? 
[B x (V x B)], = B,0;B;, - B. VB; = a> = 0,(B,B;), (3.3.4) 
and 


E,.(0.E; — dEr) = [E x (V x E)] = (E x ) . (3.3.5) 
c i 


Putting it all together, we obtain 


dr (ak ðB 


r=- | ($ xp + Ex E) + [drat (3.3.6) 
4rme \ ôt ðt Ji 
v 


where 7;, is the sum of the familiar electric and magnetic stress tensors 
(1.3.19) and (2.6.17). The new term on the right of (3.3.6) is the time 
derivative of a vector 


ea - | ae F2. (3.3.7) 
4c 


Vv 


Transpose —dPem/dt to the left-hand side of (3.3.6) and recognize that 
F = dp,,/dt, where Pm is the material momentum of the charges and 
currents acted on by the fields.” Thus, we have 


d 
dt (Pm T Pem;) a fas: Tix (3.3.8) 


Ss 


which clearly identifies Pem as the electromagnetic momentum contained 
in the volume V. 

We turn next to energy. In this case, we prefer to keep the possibility 
of describing dielectric media. The dynamical equations are then 


VxE=--— (3.3.9) 


and 


?We know that this statement holds for nonrelativistic systems. For relativistic systems, 
the final equation (3.3.8) is correct, although the force defined as dp,,/dt does not have 
simple relativistic properties. We will come back to this issue when we discuss relativity in 
Chapter 6. 
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yx Ha 12D, 4d 
c ôt c 


(3.3.10) 


Now dot H into (3.3.9), E into (3.3.10), and subtract. There results 


, : 
l(et +a E) aed e+ Vx H-H VE) (3.3.11) 
at dt c 


or, if we integrate over a volume V, 


al (E: P+. oR) = fari: eE (E xH) (3.3.12) 
ot t 


vV 
= fari: gi dS (3.3.13) 
Vv 


where 9P, the Poynting vector, is 


@=- Ex H. (3.3.14) 


4a 


Equation (3.3.13) clearly is an energy balance equation: On the left is the 
rate at which field energy is lost, on the right the two loss mechanisms, 
doing work on charges and escaping through the surface surrounding V. 
The increase in electrical field energy (including possible absorption by 
matter) is 


sw= | are: 6D (3.3.15) 
4r 


with an analogous term for magnetic energy. For a material with a 
symmetric, time-independent susceptibility tensor 


6D; = €,;5E; (3.3.16) 


and 


EE ;€;; 
EEN T (3.3.17) 
4r 87 


94 Time-Dependent Fields and Currents 


so that (3.3.15) can be integrated to give 
W= =| dr E;E,€;;. (3.3.18) 


To study the general case, we consider a situation in which the field 
is cycled from zero to zero, and integrate ôW to see if energy is absorbed 
in the process. The total absorption will be 


w>] dif are (3.3.19) 
4 at 


and, for passive matter, must be nonnegative. We evaluate (3.3.19) by 
inserting Fourier transforms for E and D: 


x 


1 dw ~ wor | dw ~ i 
AW = — di de | 2 B ems | 5, wje i! 
4r | Vag e) at) Via OM 
(3.3.20) 
or 
1 ~ m 
AW = ifar | dw iwE;(w) €;.(—@) Ex- o). (3.3.21) 
T 
We introduce the real and imaginary parts of E; and €;,: 
E(w) = R (w) + ilw) (3.3.22) 
and 
Eik = eR + iel $ (3.3.23) 


It follows from (3.2.41) and (3.2.42) that R, and e% are even functions of 
w and I; and e}, are odd functions of w. Thus, 


a 


1 
T 
0 


We see that absorption comes (for w > 0) from the anti-hermitian part of 
Ejk: 
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ee | Re, S 
2i jk 


which must be a nonnegative matrix, that is, 


(fF (€a) jx fx) = 0 (3.3.26) 


for any vector f. 
For the special case of a scalar dielectric constant €, (3.3.26) becomes 


Im e(w) > 0 (3.3.27) 


for w>0. 

Upper half-plane analyticity and imaginary part positivity on the real 
axis further restrict the properties of e€. 

First, they imply that e(w) has no zeros in the upper half œw plane. 
Second, they relate the signs of the real part and the imaginary part of 
Ve on the real axis: They must be equal. (See Problems 3.13 and 3.14.) 

We shall see in the next section an example in which the upper half- 
plane analyticity of e(w) is closely related to the positivity of €,. This is 
not surprising since both these properties are related to the passivity of 
matter. Clearly, if there is power being put into matter, then the absorp- 
tion AW can be negative, and the polarization density in the medium can 
precede the application of the electric field. 


3.4. POLARIZABILITY AND ABSORPTION 
BY ATOMIC SYSTEMS 


We consider an atom in a uniform but time-dependent electric field, and 
calculate the Schrödinger wave function y(r,,...,r,,¢) for the atom, 
where r,,...,Fr, are the coordinates of the electrons in the atom. The 
neglect of the motion of the nucleus is a very good approximation and 
does not affect the conclusions we shall draw. 

The applied field is 


E(t) = e E(w) 


E 
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and the Hamiltonian is 
H = Ho + H, (3.4.1) 
where H, = ~E- X and X is the electric dipole operator of the atom: 


X=eDr,. (3.4.2) 
We try to solve the Schrödinger equation 
--— = Hy (3.4.3) 


in the presence of E. Since we are looking for linear effects, we will use 
first-order perturbation theory to calculate y and then use to calculate 
the expected value of the dipole operator X: 


(X) = | ber), ae Ya )X y(r, ses rn) dT, eo 8. dr,,. (3.4.4) 


We solve (3.4.3) by assuming? y=yo+ yi, where y, is first-order in 
E. Wo satisfies the equation 


_ fh dho 
ðt 


Hopo = (3.4.5) 


and will be taken here as the ground state wave function, uo e ‘“°"”, for 
which 


Hotto E Wouy. (3.4.6) 


Of course, this is appropriate only for a system at a temperature T such 
that no significant excitation is present. 
We rewrite (3.4.3) as 


-77 n E= (Ho + Hi) yo + y). (3.4.7) 


i ðt i ðt 


The zeroth-order term is satisfied by (3.4.5); the first-order equation 


*We take the ground state to be nondegenerate. 
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ha 
a Hi + Hipo. (3.4.8) 


i at 


To solve (3.4.8), we expand Wy, in the complete set of eigenfunctions of 


Ho: 


Wy = Delt) un. (3.4.9) 


Since Hou, = W,,u,,, our equation (3.4.8) becomes 


hac dw ow 
2 ce Sa Cn Wr |e =-X. eK uge ine 
n i at V27 iw) 9 

(3.4.10) 


We look for solutions 


a 


Cn = | dwe OA d (a) (3.4.11) 


mye 


from which 


> (Wo iad Wn + ho)d,,(w) u, = — > Cag a (3.4.12) 


and hence 


d,(w) = — f aruzxuo: Ee) 
Wo P W,, + how 


and 


$ 


a E(w) 
== | dee sotto Y -t | de usu | SAT (3.4.13 
ġ | n Wo = W,, + Aw my V 2r ( ) 


where fdr stands for fdr, dr-...dr„. Note that we have calculated the 
driven part of #,. One can always add to ys, a solution of the homogeneous 
equation, c, = c, (0) e7 ™"*, with c,(0) completely arbitrary. However, 
in the real world such an added term with n + 0 would damp out rapidly 
by radiation. An added term with n = 0 is simply a change of normalization 
that must be canceled in the integral (3.4.4). 
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Continuing from (3.4.13), we calculate the electric dipole moment: 
(x) = | (US +WD X Co + th) dr- (3.4.14) 


The zeroth-order term is zero, since reflection invariance of Ho requires 
a nondegenerate uo to be either even or odd under the transformation 
r -> —r. The first-order term is 


(X) = | wo Xy, dr + complex conjugate (3.4.15) 
or 


i XonXno0° E(w) 
Xe | Sep See ee 3.4.16 
V 2a n Wo = Wp +w ( ) 


where X,,9 is the electric dipole matrix element from to to u,„: 
Xo = f ux Xuodr. (3.4.17) 


We explicitly add the complex conjugate to obtain 


dw ey, XonXno* E(w) 
V 20 n Wo- W, + ho 


-x 


X=- 


x 


do iay XX5 EX(w) 
V2 n Wo - W,, +ho 


-x 


(3.4.18) 


Next, change w to —w in the second term of (3.4.18): 


a 


(Xi) =- 


ger lee È. (w) 5 5] XionX jno $ Xino X jon } 
Wo - W, + hw Wo 7 Wp a hw 


(3.4.19) 


| vas 


where we have used the hermiticity of the operator X, that is, 
Xo, = Xno, and the reflection property of E, that is, E *(—w) = E(w). We 
then have a final formula for polarizability by an applied field: 
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Xi nX jn X; nX in 
a,,(w) =- z| seme eg re cect }. (3.4.20) 
n Wa = W, F hw Wo = W,, = hw 


There are some important properties of a;; to be noted. First, the 
numerators for each n are Hermitian matrices in i, j space. That is, 


(Xion Xino) = (XjonXino)* = XionX jno . (3.4.21) 


Therefore, any anti-Hermitian part must come in some way from the 
denominators. 

Second, the numerator matrices in (3.4.20) are nonnegative. That is, 
since (X;)on(X;Jno = (Xion(Xj)on, any vector V; will make 


* _ y* * 2 * 2 
Vi Xion X jno V; = Vi XionX jonV; = [VF Xion > = 0. 


Third, if the Hamiltonian Hp is real—not only self-adjoint, but real — 
then the wave functions u,, can also be chosen to be real, and Xion = Xino, 
so that a;; is automatically symmetric. The Hamiltonian Ho can always be 
made real if time reversal holds. Thus, time reversal invariance produces a 
symmetric @;,.* Note that a fixed magnetic field Bo will violate time reversal 
invariance since the v x B/c force depends on the sign of the velocity, 
which reverses under time reversal. (See Problem 3.5.) 

Fourth, when haw is equal to W,, — Wo for some n, the first term of 
(3.4.20) for a;;(w) becomes infinite. This comes about because we have 
ignored the fact that the excited atoms can radiate. Classically, this radi- 
ation limits the amplitude of oscillation that can be produced by the 
applied field and therefore keeps (X;) finite. (See Problem 4.8 and Section 
5.9.) Quantum mechanically, the possibility of radiation gives the energy 
level W,, a width, which appears as a negative imaginary part 


W, => W, — iz (3.4.22) 


and prevents the pole from appearing at real values of w, or, in this 
approximation, anywhere in the upper half-plane. 

Fifth, the polarizability a;,(w) > 0 as w — œ, provided the sum 
is Xon converges. This is presumably a general property of material 
systems, where the convergence of the sum reflects the inability of matter 
to follow an infinitely rapid oscillation. It then follows that the dielectric 
tensor approaches §j,: lim» €;x(@) = jx. 


“Time reversal invariance is discussed briefly in Section 6.1. 
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C C’ 
Wg Wg 
original contour deformed contour 
Figure 3.1. 


Finally, we turn to the question of absorption. Recall that a;;(w) 
will be causal if 


x 


d —iwt D 
(Xj) = | See alw) Elw) (3.4.23) 


-x 


vanishes for ż <0, provided E(t) vanishes for t< 0—that is, provided 
E(w) is analytic in the upper half-plane. For that to be the case, we must 
for t < 0 be able to close the contour in (3.4.23) in the upper half œw plane. 
We will be able to do so {since a;,(w) is an analytic function of w, except 
on the real axis] if we define the integral (3.4.23) to be calculated with w 
slightly above the real axis. This prescription guarantees that our causal 
condition will hold. At the same time, it gives a finite definition to the 
integral (3.4.23), as well as to the sum (3.4.20) for a@;; in the case where 
Wo + Aw is in a continuum region of the W,,’s. 
Consider as an example the integral 


ro = f ew Ate) (3.4.24) 
Wy — Ww 


where wo is real, g(w) is analytic in the upper half-plane and on the real 
axis, and the contour C is above the real axis. Then for «<0, we can 
close the contour in the upper half w plane and find f(t) = 0. For t> 0, 
we proceed by deforming the contour (as shown in Figure 3.1) onto the 
real axis, except for a semicircle of radius y, which we will eventually let 
shrink to zero. f(r) is then given by 


@y—¥ we 
JO = iim | Í doen £) , { dæ e ~ia E) 
y—0 w w PEF 
ae wot y 
Wg w 
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The first two integrals in (3.4.25) define the Cauchy principle value 
P. In the C’ integral, since g is analytic on the real axis, we can expand 
w about Wy: 


w = wat ye” (3.4.26) 


and keep the only term that fails to vanish as y — 0: 


0 
—iwt doe” tart 
lim [eee vi | A len) (3.4.27) 
y0 wo — w —ye 
Cc’ n 
so that 
dwe '” a 
ft) = P| g(w) + mie ''g(wo), (3.4.28) 
Wo — w 


A suitable mnemonic for 1/(w — w) is thus 


f l 
lim ——— =P 
y> Wy 7 @— iy Wy T @ 


+ im 8(wy — w) (3.4.29) 


where the first term is real, the second imaginary. 
Note that if the Fourier transform of a differential equation (in our 
case, the Schrödinger equation) leads to an algebraic equation 


(wo — w)f =g, (3.4.30) 


then the solution f can contain a term Aô(woọ — w) with arbitrary A, since 
x(x) = 0. The result for f(t), (3.4.28), shows that causality requires the 
coefficient of 6(wy — w) to be img(wo). 

Returning to (3.4.20), we see that for w > 0, a, has an anti-Hermitian 
part 


(° = ) IE 2 X jon Xknoô (Wn os Wy rae hw) s (3.4.31) 
I jk n 


corresponding to absorption of energy from the external field by the 
atomic target. 
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3.5. FREE FIELDS IN ISOTROPIC MATERIALS 


We look here for monochromatic free field solutions of Maxwell’s 
equations in isotropic materials with scalar dielectric constant €(w), per- 
meability 4(w), and conductivity o(w). (We will take up anisotropic 
media in Section 3.7.) The equations are, with all quantities depending 
on time like e‘”" 


VxH= 1 iad + 47j) (3.5.1) 
c 


and 


VxE=i-B. (3.5.2) 
c 


The constraint (divergence) equations are automatically satisfied for 
w +0. 
Set 


j= cE, D=eE and EA 
m 


Equation (3.5.1) becomes 


vx Bo bosiu + 4ro0)E 
hoe 
= -i%( 6 irie) p, (3.5.3) 
c @ 


We see that conductivity is equivalent to an imaginary part of €, with a 
pole at w = 0 representing finite static conductivity. We may therefore 
assume that 47io/w is included in €, giving the simpler equation 


Vx—=-i7*E. (3.5.4) 


We can eliminate either E or B, and find propagation equations for 
E and B alone. Thus, from (3.5.2), B = (c/iw)V x E, substituted in 
(3.5.4), gives 
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VXxXE 5 
yx ( )-Se, (3.5.5) 
H 


c 


whereas substituting E from (3.5.4) into (3.5.2) yields 


2 
vx (9 B\ aoe. (3.5.6) 
€ k! c 

Equations (3.5.5) and (3.5.6) are equivalent, in that if E is a solution of 
(3.5.5), then B = const. V x E satisfies (3.5.6); if B satisfies (3.5.6), then 
E = const. 1/e(V x (B/uz)) satisfies (3.5.5). Thus, either equation 
describes the propagation of a monochromatic electromagnetic signal in 
a linear isotropic material medium. 

Of course, there are many solutions for a given €, u, and w; the choice 
between them requires a specification of spatial boundary conditions. We 
will return to this point later. For now we confine ourselves to homogene- 
ous media, so that e and yw are spatial constants. In that case, (3.5.5) and 
(3.5.6) become 


2 
-VE = euE (3.5.7) 
C 
with 
V-E=0 
and 
2 
-VB = = euB (3.5.8) 
E 
with 
V-B=0 
and 
sein, E (3.5.9) 
iw iw ep 


The solutions of (3.5.9) with definite wave number k are, with w* = 
eklen, 


E=ee™* (3.5.10) 


104 Time-Dependent Fields and Currents 


and 
B= be** (3.5.11) 
with 
yee (3.5.12) 
w 
and 
c 
e= — k xb. (3.5.13) 
Ew 


Thus, the solution, for given k and w, with c?k’/eu =’, is 


determined by the vector e, which must be transverse (e: k = 0). For each 
k and w, there are two linearly independent directions of polarization, 
e,. For example, with k in the z direction, these modes could be e,, = ê, 
and e,, = &,. 

The most general complex vector e, represents a state of elliptic 
polarization. Recall that the real fields E, and B, are calculated by 
taking the real part of the complex vectors E and B. With e, = e; + iez, 
orthogonal to k, we have, at any one point x, with ¢ =k-x, 


E, = Re(e, tie)e “*'* or  E, =e, cos(wt— o) + e sin(wt — $) 
(3.5.14) 


which describes an ellipse in the e;, e2-plane. There are two special cases: 
e, parallel to e2, in which E, varies without changing direction. This is 
called plane, or linearly polarized. Second, e, is equal in magnitude and 
perpendicular to e2. This is circularly polarized: E, moves along a circle 
of radius |e,| = |e2|. 

The handedness of the polarization is defined by the screw sense of 
the rotation of the electric field with respect to the direction of propa- 
gation. Let the direction of propagation be z, with x, y, z being a right- 
handed system, and e = ê, + iê, so that E, = ê, cos wt + ê, sin wf moves 
like a right-handed screw. Thus, e. =ê + iê, is right/left circular 
polarized.” 

The most general solutions of the propagation equations in the iso- 
tropic medium are, with €,, chosen once and for all for each k, 


“There seems to be some disagreement in the literature on the definition of left and 
right circular polarization. Our choice makes right circular polarization coincide with positive 
photon helicity. See Section 3.8 for a discussion of helicity. 
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E(r, t) = | ao f EN dk > e, e**—© a(k, w, A) (3.5.15) 
k =epw lc 


Ai 
and 


B(r, £) = | de | _ ak exo EF k x @ alk, w, A). (3.5.16) 
kK=epw Ie 


w Ài 


Before continuing, we distinguish between two situations. First, a 
signal that lasts for a finite time is described by a function f(t) with a 
Fourier transform f(@) = | FONT)“ dt, such that integrated fluxes, 
like f f(t) g(t) dt, will be given by 


s 


f f(t) g(t) dt = f f*(w) Bw) dw. (3.5.17) 


-x 


Second is the case of a monochromatic signal, or a sum of monochro- 
matic signals for which 


f(t) = Rel foe") 
and 


g(t) = Relgo e~"); 


the time-integrated flux will, of course, be infinite; the item of interest 
will usually be the time-averaged flux (or energy density). From 


f = (Re fo) cos wt + (Im fo) sin wt 


g = (Re go) cos wt + (Im go) sin wt (3.5.18) 


we have 


fg = — [Re fo Re go + Im fo Im go] 


— N je 


= 5 Re(fé 8o). (3.5.19) 


We can now calculate the average energy density and energy flux of 
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a polarized monochromatic plane wave (with real e and y): 


E, = Ree, e**" (3.5.20) 
B, = Re as ee ERN, (3.5.21) 
w 


The energy density is, from (3.3.13), 


E B 
8m Srp 


whose time average, from (3.5.19), is 
1 2k? 
feet e, + SE etsea} (3.5.22) 
87 H 


=É et e. (3.5.23) 
87 


Note that the magnetic and electric contributions to ¥ are equal. 
The time-averaged flux of energy is, from (3.3.14), 


a3 2 
4a 2 47 wu 
c’k C A 
= —ež.e =u- k 3.5.24 
ary F ( ) 


corresponding to a velocity c/Vep. 
How do we take into account the imaginary part of eu in the equation 


2,2 
2_ck, 


eu 


@ 


(3.5.25) 


We can understand most clearly what happens here by observing that for 
Ven complex, either k or w (or both) must be complex. Therefore, our 
considerations cannot apply to a medium occupying all of space for all 
time, since complex w implics an exponentially growing field in time 
(either future or past), complex k an exponentially growing field in space. 
Imposing consistent initial conditions requires that the dielectric be of 
limited spatial extent (including possibly semi-infinite). The incoming field 
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may then be specified in free space outside the dielectric for all previous 
times, with the field in the dielectric zero for all previous times. 

We shall see next how a sensible formulation of boundary conditions 
and time development presents itself naturally with a semi-infinite 
medium. For this purpose, we consider the simplest possible situation: a 
semi-infinite dielectric to the right of x = 0, and a plane polarized electro- 
magnetic wave incident normally from the left. 

The incident electric field is E = ê, Eine with 


* 


Eine = Í dar f (w) eeo] (3.5.26) 


-x 


where b <0. 

We chose f (œw) to be analytic in the upper half w plane. This ensures 
that E = 0 for (x/c) — t — (b/c) > 0; in particular, at t= 0, E and oE/dt 
vanish for x > b. It also makes E independent of the path of the w integral 
in the upper half-plane, provided f(w) — 0 sufficiently rapidly as w > œ. 

The incident magnetic field is B = ê, B,,. with 


oc 


Binc = | dw f(w) eivl@te)~ i (ble) | (3.5.27) 


-x 


Maxwell’s equations become, for this simple geometry, 


sae (3.5.28) 
Ox c at 

and 
ees aes (3.5.29) 
Ox c ot 

where 


D=@,D and B=H. 


The divergence equations are satisfied identically. 
The boundary conditions at x = 0 follow directly from (3.5.28) and 
(3.5.29). They are E and B continuous. 
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The appropriate solution of (3.5.28) and (3.5.29), with u = 1 and e 
a function of w, is 


E= | de e7 iat Goble) (ers coe R(w) grew) fæ) , x<0 (3.5.30) 
and 


E= faw ei GAT (o) e flo), x>0 (3.5.31) 


where R(w) is the reflected amplitude and T (w) the transmitted ampli- 
tude. The wave number in the dielectric is k’ = wVe/c. In order for E 
not to grow exponentially, we must choose Im k’ > 0. 

The magnetic field is given by 


B Z fao giet liobl) [ee + R(w)e acca Ffw), r<0 (3.5.32) 
and 


B= ve| dee let (POT a) e fle), -x=0. 


The boundary conditions at x = 0 are 


1-R=T (3.5.33) 
and 
1+R=VeT (3.5.34) 
so that 
2 
T= 3.5.35 
1+ Ve ( ) 
and 
R= Ve-1 (3.5.36) 
Ve +1 ~ 


This solution gives E and B the desired properties: for t < 0, 


E 


I 
m 


(3.5.37) 
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ð Eine 
OP ie. (3.5.38) 
ot ot 
B= Bine (3.5.39) 
and 
OB = Biss (3.5.40) 
ot ot 


This follows from the known properties of e(w): €(@) is analytic and 
nonzero in the upper half-plane, and Ve —> 1 as w — œ since we have 
chosen Im k' > 0. Therefore, T, R, and k’ are analytic in the upper half- 
plane, k'— w/c as w > œ, and the reflected and transmitted waves vanish 
for £< 0. Therefore, we have correctly incorporated the initial condition 
E = Eine and B = Bin into our solution. Given these analytic properties, 
one can also show that the wavefronts propagate causaly, that is, that no 
transmitted or reflected wave shows up before transmission at velocity c 
would permit it to do so. (See Problems 3.13 and 3.14.) 


3.6. REFLECTION AND REFRACTION 


We consider now a plane wave of polarization @, wave number k, and 
frequency w = ck incident in the x-y plane from the left in air (or 
vacuum) on the plane surface of a dielectric medium with an index 
of refraction n = Ve extending from x = 0 to the right. We take u = 1. 
The angle of incidence is 0, the angle of refraction @’. We take the 
polarization (which we define as the E direction) in the plane of incidence. 
This is illustrated in Figure 3.2. (See also Problem 3.7.) 

The boundary conditions at x = 0 must hold for all y and z, so that 
k, and k, must be continuous across the boundary (taken here at x = 0). 
Thus, with w? = c?k? =c’k'?/e, we have k”? = k’: € or, since k, = 0, 


ki? + k2 = (k? + ke. (3.6.1) 


Consider first real eu. Then we have k? = k? sin?6 and k? = k”? sin’0', so 


=i =— (3.6.2) 


which is Snell’s law. 
For complex €, we return to (3.6.1). We consider first the case of 
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Figure 3.2. 


high conductivity, 4a470/w > 1, for which 


4rio 


$ 


w 


4ria 
2 2 
ke ae 


k 


l+i 4mo 
kin Kote 3.6.3 
f V2 w ( ) 


V 


so that all fields in the conductor damp out like e`“ \?7%®, beyond the 


skin depth 
EN ee (3.6.4) 
k V2a0 ce 
the fields go rapidly to zero. 


If e has only a small imaginary part, the real part of k} will be 
determined, as usual, as Re k, =k’ cos 6’ with 6’ given by Snell’s law. 
The imaginary part, Im kų, will be given by the equation 


and 


(Re ki + iIm ki}? + k2 = ke (3.6.5) 


and 
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2 
Re kt 
_k_ sing 
~ 2cos 6’ sin ð 


Im ky = Ime 


e, (3.6.6) 


neglecting (Im ki)”. Thus to a good approximation (for small Im e), the 
transmitted wave behaves as if e is real, except, as we move into the 
medium, for a damping given by 


|f(x, y, z)| = e™™ ==] FO, y, z)| (3.6.7) 


with Im k; given by (3.6.6). The direction of energy flow is given by the 
real part of the wave number, Re k’ = @, Re k; + ê, ky. 

In going from (3.6.5) to (3.6.6), we have chosen the positive root of 
(3.6.5) for ką. For a medium extending to œ, this choice is evidently 
required and, as we have seen in the previous section, is consistent with 
the causality requirement that Ve(w)— 1 as w — œ. If the medium ex- 
tends only a finite distance in the x-direction, there will be a second set 
of boundary conditions at the second surface. In order to ensure that our 
solution corresponds to the physical input (incident wave on left, reflected 
wave on left, transmitted wave on right with no incident wave on right), 
we will have to use both roots of (3.6.5) in our solution. However, if the 
wave has substantially decayed by the time it hits the second boundary, 
the numerical effect of this change is small. In the following we assume 
that the second surface is far enough away to be neglected. 

We return now to the first surface and construct the solution that 
satisfies the boundary conditions. 

The electric field is given, for x < 0, by 


E ~ eilt tky wn e— el Ext ky wt) er (3.6.8) 
where epr is the reflected amplitude) and, for x > 0, by 
p 
E = eit D ep (3.6.9) 


The minus sign preceding ex in (3.6.8) is chosen for convenience. 
Thus, with 


B=b eilkextk,y— wn) — br e Fat kyy ot) (3.6.10) 
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for x < 0, and 


B = by e Fr kyon (3.6.11) 
for x > 0, we must have 
pS Sie: (3.6.12) 
w 
bg = ne X er, (3.6.13) 
w 
and 
k’ 
br=— xez. (3.6.14) 
w 


The condition V - E = 0 determines the vectors e, eg, and ey to be 


e, = —sin 0 (3.6.15) 
e, = cos 0 (3.6.16) 
er = Rsin d (3.6.17) 
ery = R cos 0 (3.6.18) 
et = -Tsin b’ (3.6.19) 
and 
e, = T cos 0’ (3.6.20) 


where R and T, the scalar reflection and transmission amplitudes, are to 
be determined from the boundary conditions: AE, = 0 and AB, = 0. From 
AE, = 0, we find 


cos 0(1 — R) = cos 6'T, (3.6.21) 


and from AB, = 0, we find 


AG + R}= aa T. (3.6.22) 
we 


WC 


The solution for R and T is 
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_ k' cos @ — k cos 6’ 
k' cos 8 + k cos 6’ 


R (3.6.23) 


and 


_ 2k cos 0 
k' cos 0 + k cos 6’ 


(3.6.24) 


(See Problem 3.4 for a discussion of energy balance). 

There are two interesting limiting cases to consider. First, high conduc- 
tivity, where k’ will dominate in (3.6.23) and make R=1. Second, 
Im e = 0. Then k} and k’? are real, and (3.6.23) shows that R can vanish. 
This happens at 6, (Brewster's angle) when 


k’ cos 8 = k cos 6' 
or, since k’ = Vek, 
sin 26, = sin 26}. (3.6.25) 


Equation (3.6.25), together with Snell’s law, has one root: 6 + 6’ = 
7/2. Thus, at an angle of incidence 6, such that the reflected and refracted 
rays are orthogonal, there will be no reflection of an incident field polar- 
ized in the plane of incidence. 

One can understand this phenomenon by remembering that reflection 
consists of radiation by dipole moments of the dielectric. A dipole polariz- 
ation with direction e’ cannot radiate in its direction of polarization (as 
we shall learn later). Since e’ is perpendicular to k' and kg is also perpend- 
icular to k’, Kg is in the direction of e’ and, hence, there is no reflection 
at the Brewster angle. (See Figure 3.3.) 


Figure 3.3. 
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3.7. PROPAGATION IN ANISOTROPIC MEDIA 


We now consider a tensor dielectric constant, which we assume to be real 
and symmetric. (We could as easily take e; to be Hermitian; however, an 
anti-Hermitian component—i.e., absorption —is a significant complication 
that we do not discuss. It would normally be treated perturbatively, and/or 
numerically.) We assume unit magnetic permeability. Note that a scalar 
permeability would not be a significant complication, but a tensor perme- 
ability would. 
The relevant equations are as always (for a fixed wave number) 


~b=kxe (3.7.1) 
Cc 
kxb=-“d (3.7.2) 
c 
and 
d: = E;jEj. (3.7.3) 


We see from (3.7.1-3.7.3) that there are two sets of right-handed 
coordinate systems associated with these vectors: k, d, and b on the one 
hand, and e, b, and P= (c/4m)e x b on the other. Further, since k, e, 
and d are all orthogonal to b, they are coplanar. In general, e x b and k 
are not in the same direction. Therefore, the direction of energy flow and 
that of k will be, in general, different. An exception that we shall see 
below is the ordinary ray of a monoaxial crystal. 

The controlling equation for e is obtained by substituting (3.7.1) into 
(3.7.2) to yield 


2 2 
[k x (k x e)]; ES a E;jEj or ke; —k- ek; = = Ee; (3.7.4) 


Equation (3.7.4) has an orthogonality property between the solutions 
with different eigenvalues of w°, say, ef with w? and ef with w3, to wit 


e; eje? = dfe? =0, w, + wg. (3.7.5) 


This is derived in the usual way by multiplying (3.7.4) for e? by ef, (3.7.4) 
for ef by ef, and subtracting. The left-hand side vanishes, leaving 


2 2 

(Wa — wp) 

weie ef ee, =0. 
c 
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Note, however, that in refraction it is w that is the same in both media, 
not k; therefore, this orthogonality does not hold between the two modes 
of propagation for given w. 

To study (3.7.4), we choose coordinate axes x, that diagonalize ez. 
That is, 


> €apep = Eala (nosum) (3.7.6) 
B 


where € are the eigenvalues of €,, considered here to be known functions 
of w. In the x, coordinate system, (3.7.4) becomes 


2 
k- ek, — k’e, = — 2 €a€a (no sum). (3.7.7) 
c 


We expect there to be, for each k, three eigenvalues for œ”, for which 
we can solve (3.7.7). 

The first eigenvalue is universal and uninteresting: e, = ka which gives 
w = 0. There remain two eigenvalues. We first “solve” (3.7.7) for ea: 


pak (3.7.8) 
ke - eat Eq 
c? 
and, multiplying by k, and summing, we obtain 
k? 
k- e= È kaea = k: e X — 7 (3.7.9) 
fA “2-26 
c? j 
so that, for k -e #0, 
2 
a =1 (3.7.10) 
* 
c? 7 
and, with 1 = = k2/k?, 
2 2 2 
De a =0 o 2—0. (87.11) 
= ae “k-e, 
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Multiplying (3.7.11) by 


2 2 2 
a ae |e) 
Cc 


gives a quadratic equation for œ? with two roots. Note that if it is 
necessary to take dispersion in €, into account, so that e, itself is a 
function of w, the eigenvalues of w are not obtained simply by solving 
the quadratic equation (3.7.11). In the following we will assume that over 
the width of the wave packet with which we are dealing the dispersion 
can be ignored, that is, the €, may be taken as given positive constants. 

Here, we again point out that we have solved the first of two obvious 
problems. This is to find the propagating mode frequencies and polariza- 
tions, given the wave number k in the medium. The converse is also 
straightforward: Given a frequency w and a direction of propagation k, 
find the wave number and polarizations of the two modes. The second, 
and harder, problem arises in analyzing refraction in a biaxial crystal. 
There one is given the incident frequency w and wave vector k, and 
trivially determines the two components of the transmitted wave number 
that lie in the boundary between the media and, hence, are continuous 
across the boundary. The problem is then to find the third component of 
the wave number and polarization of the propagating modes. (See Prob- 
lem 3.11.) 

We turn now to the simple example of a monoaxial crystal, that is, 
one in which two of the eigenvalues, say, €, and €3, are equal. Then any 
pair of orthogonal axes in the 1, 3-plane are eigenvectors of the e matrix, 
and we can directly construct a solution of (3.7.1) and (3.7.2) by choosing 
the 1-axis so that k lies in the I, 2-plane, e along the 3-axis, and b perpen- 
dicular to k in the 1, 2-plane. Since e; is an eigenvector of e;; with eigenvalue 
€, d is also along the 3-axis, with value d = e,e. This is illustrated in 
Figure 3.4. 


Figure 3.4. 
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Since k - e = 0 for this mode, equation (3.7.4) yields 


2K? 
w=, (3.7.12) 


€i 


where the subscript O stands for ordinary. Further, the direction of energy 
flow for this ray (the ordinary ray) is in the direction of k, so that the 
ordinary ray behaves as if it were propagating in an isotropic medium. 

For given k, the second mode (the extraordinary ray) will have its d 
vector de perpendicular to k and to the e vector eo, of the ordinary ray. 
Its frequency will be given by (3.7.11), that is, 


2 2 2 2 
kic k3€2 _ 2_ 2 ki{e, + k5e2 
Ser z =Q or w = co, 
_ we fi aes €> €;€2 
2 2 
Ç C 


(3.7.13) 
k? 


The vectorial nature of the extraordinary mode can be easily con- 
structed. With k = (kı, k2), we have de: k = 0, or, in an obvious notation, 
ee = (e,, €2) is perpendicular to €;;k; = (€1ki, €2k2). So, 

ee = (— €2k2, €&,k,) X constant. (3.7.14) 
The direction of energy flow is 
Puxexbxe x (k Xe) = e°k —e- ke 


or 


P x (bk3 + €{k7) (ki, k2) — (ezkzkı — €rk2k1)(e2k2, —€1k1) 
= (€,k7 + €ok3)(e,k1, €2k2) 


so that the direction of energy flow is 


(eiki, €2k2) 
(ek? + 43)? 


ĝ = (3.7.15) 


This is also the direction of the group velocity that follows from (3.7.13), 
still neglecting dispersion: 


c? 


(kiei, K2€2). (3.7.16) 


Vw SS 
EJE2W 


In general, an incident ray on a monoaxial crystal will require, at 
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given w, both modes to satisfy the boundary conditions. Since the ordin- 
ary mode frequency wave number relation is independent of angle, the 
refracted light will contain both modes, the ordinary ray refracting accord- 
ing to Snell’s law, with e= e, the extraordinary ray refracting with a 
different angular dependence. (See Problem 3.11.) 


3.8. HELICITY AND ANGULAR MOMENTUM 


We have seen that a polarization vector 


e = -—= (3.8.1) 


represents a right-handed circularly polarized wave advancing in the z 
direction. 


Cr — 1; 
aa (3.8.2) 


is its left-handed counterpart. 

The vectors (3.8.1) and (3.8.2) transform particularly simply under 
rotations. We consider a primed coordinate system that is rotated clock- 
wise by an angle 0 about the z-axis (where x, y,z form a right-handed 
coordinate system). This is shown in Figure 3.5. 

Evidently, 


A 


ê; = ê, cos 0 + ê, sin 8 (3.8.3) 


Ar 


@, = ê, cos 0 + ê, sin 6 (3.8.4) 


so that 


êl + iê, = ê, cos 0+ ê, sin 6) + i(ê, cos 0 + ê. sin A) 
= (ê, + i€,) cos 6 F i sin O(€, + i€,) (3.8.5) 
or 
e =e e. (3.8.6) 


Vectors with the transformation property (3.8.6) are said to have helicity 
+1. There is a corresponding property of the components. Thus, with 
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Figure 3.5. 


e= @,e, + Be, = ee, + Bie, (3.8.7) 


i 


A A A A A A A A 
er = @, eer tê- Èe, and ey = 8, ere, +ê ee, (3.8.8) 


or, from (3.8.3) and (3.8.4), 


ex = cos bey + sin be, (3.8.9) 
e, = cos be, — sin bey (3.8.10) 
and from 
e=e,e, +e_e_=ee}, tele. (3.8.11) 
and (3.8.6), we find 
e = exe7. (3.8.12) 


Note the opposite definition of e. from that of e~. 

There is an intimate connection between the helicity and the angular 
momentum of a plane wave of wave number k and frequency w, propagat- 
ing in empty space in the z direction. The relation, as we shall now show, 
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is 


A= (3.8.13) 


where A is the helicity of the wave, L, the z-component of the electromag- 
netic angular momentum, and W the electromagnetic energy. For a single 
photon, with W = fw and A = +1, we would find, from (3.8.13), 


L,= +h, (3.8.14) 


suggesting that the intrinsic spin angular momentum of a photon is +1. 

We calculate the angular momentum of an electromagnetic distur- 
bance propagating with approximate wave number ko, approximate fre- 
quency wo, and approximate polarization é, + i@,. These parameters must 
all be approximate, since for a wave of definite wave number both the 
energy and angular momentum would be infinite; therefore, the wave 
must be bounded in all three directions; the polarization must be modified 
as well to maintain transversality. 

We review first the angular momentum integral for charges and 
currents in free space. We have for the force density on charges 


f=pE+jxB, (3.8.15) 
and from (3.3.6), 


1 à 
pose es (E x B); + 447 :x (3.8.16) 
4rc ot 


(where 7;, is the Maxwell stress tensor). From (3.3.6), we derived the 
expression 


ExB 
Peon. = far TAS, (3.3.7) 


v 


for the electromagnetic momentum pPe.m, 
Similarly, the torque density is 


=rxf or TES EijkXj fk (3.8.17) 


so that, with L, the angular momentum of matter, we have 
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dL.m; 1d 
di R Aac di dr[r x (E x B)|; + €ijx f drx;ôTu (3.8.18) 
or, with 
1 
Lem. = — Í r x (E x B) dr, (3.8.19) 
4rc 


(T Lem. 
+ 


di dt ) = Eijk f dr X;0;T ys. (3.8.20) 


Vv 


Since €; 7; = 0 (e€ is antisymmetric, T symmetric), we have from (3.8.20) 
an angular momentum fiux density 


By = Eijk Xj Tri, (3.8.21) 


and for fields that go to zero sufficiently rapidly, a field angular momentum 
dr 

Lem. = | —— r x (E x B). (3.8.22) 
4re 


Our next task is to evaluate the integral for L,.,,. for an approximate 
monochromatic plane wave. We call wo and ko the approximate values of 
the frequency and wave vector. 

We start with the general expression (3.8.22). We express the fields 
in terms of their Fourier transforms, automatically satisfying the free field 
equations. Thus, with a convenient normalization, 


E= dk a(k) we 4 oe, (3.8.23) 
(277) 
and 
iz | dk ck x a(k) e™**-" + c.c. (3.8.24) 
(Qn) 


with w = c|k| and k -a = 0. 
The energy in the electromagnetic field is 


W= [ar EEP | alapo’: (3.8.25) 
T 
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the momentum is 


P= | dkla)’kw; (3.8.26) 


they have the approximate ratio 


= = = 2 (3.8.27) 


The terms in the integrals for these conserved qualities that come from 
products of a with a or a* with a* must be zero, since otherwise their 
time dependence would violate the conservation law. 

The angular momentum is a little harder. With L given by (3.8.19) 
and E and B by (3.8.23) and (3.8.24), we have 


1 


Fear) rex || ear wen 
mT T 


x fax k x a(k) aaa + ¢.c. (3.8.28) 


Here again, we have dropped products of a with a and a* with a*. 
To carry out the integral in (3.8.28), we replace re“ by 
iV, e '* F, do the r integral, and find 


L = Re | dk dk' iV, & (k = k’) x [a*(k') w x (k x a(k)] eile rane 
(3.8.29) 


We integrate by parts, carry out the indicated derivatives, and then 
set k = k’, leaving 


Va ; 
L= f di} ware) k x ~‘a,(k) + l a* x ao). (3.8.30) 
i i 


Equation (3.8.30) decomposes the angular momentum into two terms. 
The first depends in detail on the structure and location of the wave 
packet; the second does not. We study a wave packet chosen to maintain 
the monochromatic and planar nature of the field to the maximum extent 
possible, consistent with the requirement that the total energy in the field 
be finite. Thus, we let 
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aVw = [(@ + i@,) kz — (ky + iky) Be (kK); (3.8.31) 


b(k) is a real function of k — kọ, with widths Ak,, Ak,, and Ak, much 
smaller than ky. The vectorial coefficient in (3.8.31) is chosen to ensure 
the transversality of a. The wave vector Kg is in the z direction. 

Evidently, the explicit phase —ik - ro displaces the wave packet by a 
vector rp and adds an orbital angular momentum Lo, = ro X P to the 
angular momentum. We can now easily calculate the energy, momentum, 
and angular momentum carried by the wave function (3.8.31), neglecting 
the spread in wave numbers Ak compared with ko. We find 


W = 2cki f dk|b/’, (3.8.32) 
P = 2kiky i dk|bP, (3.8.33) 

and 
L=ryxPt 2koko | dk|b/’; (3.8.34) 


corrections are of order (Ak)*/kj. The second term in (3.8.34) corresponds 
in quantum theory to a spin angular momentum. Our result states that 
the wave with helicity +1 carries an angular momentum 


L.=+ (3.8.35) 


Ww 
ss 
Note that setting W = fw leads to L, = +h. 


CHAPTER 3 PROBLEMS 


3.1 At r= 0, a charge density po(r) is found to exist in a medium with 
constant real dielectric constant € and conductivity a. Assume that 
J drp(r) = Qo is finite. 
(a) Show that the charge density decays according to 

p(r, t) = po(r) e Greleyt 

(b) Where does the charge go? Show explicitly that the outgoing 
current correctly accounts for the disappearance of the charge 
for the special case of a spherically symmetric po. 
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3.2 One could try to construct a model of a finite size charged particle 


by postulating a charge density p(r,t) = qf(r — y(t)), where y(t) is 
the trajectory of a fixed point in the charge distribution and r the 
point in space at which one is specifying the charge density. If q is 
the total charge, we must have 


fre -y0 dr=1. 


A point charge would then have f(x) = & (x). 
We have seen in Problem 2.1 that a suitable current density to 
go with p above would be 


dy(t) 


aa 
įr, t) es 


fe- y0). 

The limit of a point charge is singular, since the electric field at 
the position of a point particle is infinite. This singularity does not 
occur for a finite size charged particle, such as suggested above. The 
field equations would be, as usual, 


ype 228 
c ôt 
V- E= 4rp 
vxpa te, 4j 
c ot c 


and V-B=0, with p and j as given above. The particle motion 
(nonrelativistic) would be given, following Newton and Lorentz, by 
the equation 


mý = | ararty — n| er, t)+ lady x B(r, |. 
c dt 


(a) Suppose there are n charges, each with charge q;, mass m;, and 
coordinate y;. Give the appropriate generalization of the Maxwell 
and Newton — Lorentz equations. 


(b 


— 


Show that there is a conserved energy in a volume V, consisting 
of the sum of the electromagnetic energy and the kinetic energy 
of the charged particles contained in V [as in (1.3.11)], provided 
that the surface integral of the Poynting flux integrated over the 
bounding surface of the volume vanishes, and that no particle is 
close enough to the boundary for p(r, t) to be different from zero 
on it. For this purpose, assume that f(x) vanishes for |x| >b, a 
particle radius. 


3.3 


3.4 
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We can extend the volume to include all space, provided the 
energy integral converges and the integrated Poynting flux goes 
to zero as r > %, 

(c) Show that there is a conserved momentum in a volume V, again 
consisting of the sum of the electromagnetic momentum (3.3.7) 
and the momentum of the charged particles contained in V. 
Formulate the conditions under which this holds. 

*(d) Let f(x) be spherically symmetric, that is, f(x) = f(|x|). One 
might expect in this case that there would be a similar conser- 
vation law for angular momentum. Try it. Show that the usual 
definition works only for a point particle, that is, for f(x) = 
5°(x). There is, however, a way of constructing a conserved 
angular momentum for this case, which we will take up when 
we come to Lagrangians (Chapter 7). 


Imagine a wave packet incident on a plane dielectric boundary. We 
study the energy balance in the process for real e. Since E, and H, 
are continuous, so is (E x H)„. Here, ¢ stands for tangential and n 
for normal. More interesting is the time and area integral of the 
Poynting flux, over an appropriate closed surface: 


Q 


1= | afo -as 
S 


n 
which equals 
I = W,,(t2) = W,(ti) 


where W,(t) is the electromagnetic energy contained in the enclosed 
volume V at time £. If f and t, are, respectively, after the time of 
departure and before the time of arrival of the wave packet at the 
surface S, then J = 0. 

Let W be the incident energy, Wr the reflected energy, and Wr 
the transmitted energy. Show (using the example worked out in the 
text) that the condition / = 0 is equivalent to W = Wr + Wr, 


Calculate the dipole moment of a classical atom in a constant mag- 
netic field By. Let the electric field be given by E = Eye ‘“’, with Ey 
constant in space and the atom modeled by an electron bound as an 
isotropic harmonic oscillator to its center. Let wọ be the resonant 
frequency of the oscillator and m the mass of the electron. 


(a) Write the Newton—Lorentz equation of motion for the electron 
(charge —e) in the presence of the fields E and Bo. 

(b) Look for a solution r = r9e~'" and find the equation satisfied 
by To. 
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(c) Solve for ro by expanding in powers of By, keeping terms linear 
in Bo. 

(d) Show that a gas of such atoms, n per unit volume, would have 
a tensor dielectric constant 


Ej = £05); F i€; €;;x Box 
where the coefficient €,, the Faraday coefficient, is given by 


0€Ey 
Boe, = w, 
ðw 


with w, = eBy/2mc, the Larmor frequency. 


(e) Considering only the zeroth-order term €o, give an expression 
for the dielectric constant €p(w); from it calculate the absorption 
of energy by the dielectric from an applied field E(t) = 
f dw f(w) e~. Remember that causality requires that the inte- 
gral over w in the neighborhood of a singularity of eo be calcu- 
lated by circling into the upper half plane. 


Rotation of light in a magnetic field: Consider a beam of light moving 
through a medium whose dielectric constant is 


Ej = €9;; + i€, €;73- 
€ and e, are real numbers. 


(a) Show that right and left circularly polarized beams propagate 
in the 3 direction with no absorption, but different dielectric 
constants. 


(b) Calculate the angle of rotation per unit length of a plane polar- 
ized beam propagating in the z direction. 


A monochromatic plane polarized electromagnetic wave in vacuum 
is normally incident on a flat surface bounding a medium of real 
permeability u, real dielectric constant e€, and conductivity o. The 
circular frequency of the wave is w. Assume the wave incident 
from the left. Then the incident wave is 


E = e; e en 
t E 
and 


B, = a, x e; ei~ wr) 


where k = w/c, the fixed vector e; is in the y, z-plane, and it is 
understood that the real part is to be taken. The reflected wave is 
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E, = Re; e Arto) 
i 


(R is the reflection coefficient) and 


B, = —8, x eR e ten, 
The transmitted wave is 
E, = eT eiren 


and 


B, = 6, X eT etmon, 
T is the transmission coefficient. 


(a) Calculate the real and imaginary parts of k’ for all positive values 
of u, €, anda. 

(b) Calculate the reflection coefficient R and transmission coefficient 
T for small values of w/o. 


(c) In the same approximation, calculate the flux of energy into the 
medium. What happens to it? 


(d) Still in the limiting case of small w/o, verify that the discontinuity 
in H, from outside the conductor to well within the conductor 
(where the field is zero) is correctly given by 


at, = "ZK = 87 fax te) 
Cc C 
(0) 


where K is the surface current and j the current density parallel 
to the plane boundary. 


3.7 Find the reflection and transmission coefficients corresponding to 
(3.6.23) and (3.6.24) for the case of polarization normal to the plane 
of incidence. 


3.8 (a) Show that the boundary condition Ejangentia. Continuous between 
two media guarantees the continuity of OBnormai/dt, provided E 
and B satisfy Maxwell’s equations in both media. 

(b) Consider next the boundary condition for the inhomogeneous 
equations. For media of finite conductivity (i.e., finite j), show 
that the continuity of Hiangentia guarantees that 0/df ADnormai = 
4mðolðt (o is surface charge density), provided Maxwell’s 
equations hold in both media. 


3.9 Consider cylindrical wave guides in the TEM mode. 


(a) Show that a cylindrical wave guide consisting of the space inside 
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(b) 


(c) 


(d) 


*(e) 


a single perfectly conducting cylindrical shell cannot support a 
transverse electromagnetic wave, that is, a wave of the form 


E(r, t) = Ey e770 
and 
B(r, t) = Bo een 


where z is the direction of the cylindrical axis, and the vectors 
Eo and Bo are functions of x and y and lie in the x, y-plane. 


Show that a wave guide consisting of the space between two 
perfectly conducting cylinders can support such a wave, the 
functions Ep and Bo are unique (up to a scale) for a given shape 
of the confining cylinders, and the wave propagates with light 
velocity. The wave is called a TEM (transverse electric and mag- 
netic) mode. 


Give these functions E and B for the special case of two coaxial 
circular cylinders. 


From your result in (c) above, calculate, in terms of your para- 
meterization, the energy per unit time flowing down the wave 
guide. 

In the high conductivity (g/w > 1) and small skin depth limit 
(Im k > curvature of the cylinders, where k is the wave number 
inside the conducting medium), calculate the energy loss per unit 
time and length by Poynting flow into the conductors. Treat the 
conducting surface as planar and calculate the longitudinal elec- 
tric field by continuity: Hrangentiaa IS Continuous; SO Hrangential 
outside the conductor = Atangentiay inside the conductor; then 
calculate E, inside the conductor from Maxwell’s equations and 
E, just outside the conductor by continuity. From Hangentia and 
E,, calculate thePoynting flow into the conductors. 


3.10 Consider a cylindrical wave guide in the TE and TM modes. 


We consider here a medium of constant permeability u and 


dielectric constant €, bounded by a perfect conductor of un- 
specified cross-sectional shape. It is obviously consistent to 
separate the fields into longitudinal (z component) and transverse 
(x, y components). Thus, we write 


E = |E (x, y) +@.£.(x, y)] f(z 1) 


and 


B = [B (x, y) +8,B.(x, y) | 8t) 


*3.11 


3.12 


*3.13 
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Similarly, we write V =@, (d/dz) + V,. We take advantage of the 
absence of z or t dependence in Maxwell’s equations to let f(z, t) = 
g(z, t) = e“**- ©”, so that ð/ðz becomes ik and ð/ðt becomes — iw. 


(a) Show that, by eliminating H, from one transverse equation and 
substituting in the other (and vice-versa for E, ), one can solve 
for E, and B, as functions of E,, B,, œw, k, €, and p, and that 
E, and B, each satisfy the equation 


(V} => enod (T) =0. 


(b) Determine the boundary conditions on E, and B, at the 
perfectly conducting boundary. 

(c) Show that these boundary conditions allow solutions with E, = 
0 (called TE, or transverse electric) and with H, = 0 (called 
TM, or transverse magnetic). 


*(d) Now derive the correct boundary conditions on £E, and H, when 
both the inner and outer media have finite u and e€, and show 
that the separability into TM and TE modes no longer holds in 
general. 


*(e) Show that a circular geometry still permits TM and TE modes 
for E, and H, independent of azimuthal angle (m = 0 mode). 


A monoaxial crystal with » = 1 and real, symmetric e;; has a plane 
boundary perpendicular to the optic axis. The characteristic values 
of e;j are €, and e2. 

(a) Show that the ordinary ray refracts according to Snell’s law. 


(b) Find the generalization of Snell’s law for the extraordinary ray. 
That is, express sin 6’ as a function of €,, €2, and sin 0. Remem- 
ber that the direction of propagation of 6’ is given by Vw. 


Work out in detail the results (3.8.32), (3.8.33), and (3.8.34) 
starting from (3.8.25), (3.8.26), (3.8.30), and (3.8.31). 


Use Cauchy’s theorem and more to prove the following statements 
about e(w). Given that e(w) is analytic in the upper half w plane, 
e(w) > 1 as w—>, €(w)=e(-w) on the real axis, and 
Im e(w) > 0 for w on the real axis and positive, show: 


(a) Im e(w) <0 for w on the real axis and negative. 

(b) e(w) has no zeros in the upper half œw plane, so that Ve(w) is 
analytic in the upper half w plane. 

(c) Choose the square root of e so that it has a positive imaginary 
part for w > 0 on the real axis. Then show that Ve has a negative 
imaginary part for œ < 0 on the real axis. 
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(d) Im Ve/Re Ve > 0 for w > 0 on the real axis, so that Ve also 
approaches 1 as w > œ, 


Show that the E field defined in (3.5.30) and (3.5.31) propagates 
causally. 


A plasma is a neutral ionized gas. A simple model for a plasma is 
a dilute gas of electrons in a uniform positive background (that 
ensures overall neutrality). Call n the number of electrons per unit 
volume. It is convenient to define a characteristic plasma frequency 
wp by the equation w? = 4mne*/m, where e is the electron charge 
and m its mass. 


(a) With this model, neglecting interparticle and magnetic interac- 
tions and the space dependence of the wave ficld, find the 
reaction of the electron gas to a passing electromagnetic wave 
of circular frequency w by solving for the steady-state motion 
of an isolated electron in the electric field of the wave. 


(b) Now find the dielectric constant of the dilute gas by calculating 
the steady-state polarization per unit volume P in the wave field 
E and using the formula 


D = «€E=E+ 4nrP. 


(c) Find the conditions (in terms of e, m, n, w, and E) for the 
approximations suggested in (a) above to be valid and compa- 
tible with each other. 


The systematic way to deal with a plasma (or for that matter any 
low-density gas) is to introduce the Boltzmann function f(v, x), 
where An(v, x) = f(v, x) AvAx is the number of gas particles (elec- 
trons in a plasma, with the positive ions approximated as a positive 
background) in the six-dimensional volume element AvAx. 


(a) By considering the number of particles entering the volume 
element AvAx through the surface AyAzAv at x and leaving 
through the surface at x + Ax, and similarly the number entering 
through the surface Av,Av,Ax at v, and leaving through the 
surface at v, + Av,, and continuing to the other eight surfaces, 
derive the Boltzmann equation: 


where a(x, v) is the acceleration of a particle at x with velocity 
v and ð f/ðt|con AxAv is the number of particles thrown into the 
volume element AxAv through “collisions.” The distinction 
between —V,-af and ðf/ðtkon is not absolute. We take a to 


(b) 


(c) 
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respond to the average field acting on the particles; df/dt\.ou 
describes the rest, whatever there may be. 

In a rare, approximately collisionless plasma, a= 
e(E + v/c X B), and df/dt|.on is set equal to zero; this neglect 
must be justified a posteriori following the calculation of the 
collisionless motion. 


Recover the result of Problem 3.15 by solving for the linear 
response of the Boltzmann function to a weak applied field 
E(w). Let fo be the Boltzmann function in the absence of the 
applied field and fo + fı the Boltzmann function in the presence 
of the field. Then, if we keep only linear terms and neglect 
collisions, the Boltzmann equation becomes 


a . Œ 
TAE oe ae} (1) 
ot m 


Consider now long wavelengths (i.e., uk <w), so that the 
v: Vf, term can be dropped. In this approximation, solve Eq. 
(1) for fı and from the solution calculate the induced current 
j=eJfdvvf;. Remember that f dv fo(v, x) = no(x), the local 
density, and that j = 0P/dt, where P is the polarization per unit 
volume. 

From this, calculate the dielectric constant and recover the 
result of the equivalent calculation carried out in Problem 3.15. 


In order to include long-range electromagnetic interactions, one 
must include the plasma as the source of electromagnetic fields 
{the equations so arrived at are called the Boltzmann— Vlasov 
equations): 


L VVF Vaf =0 
t 


vapors 
c c ol 
and 
vxE= 228. 
c at 


Find the normal modes of these equations, for a given wave 
number k, with the same approximation as in part (b) above, 
that is, vk/w, but not ck/w, negligible and w of the same order 
of magnitude as w,. 

You should find propagating transverse modes with the 
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3.17 (a) 


dispersion law 
w? = w3 + 7k? 


or, with 


just as in part (b) above. 

In addition, there is a discrete longitudinal mode, approxi- 
mately independent of vk: w=a,. 

A careful treatment taking the small vk dependence into 
account shows that for a normal velocity distribution function 
fo, this mode actually damps, transferring its energy to particle 
motion. 


Formulate a set of Maxwell’s equations that would take into 
account the existence of a magnetic current and charge density 
jn and pm, corresponding to the familiar electric quantities je 
and pe. Clearly, jm and p,, would go on the right-hand side of 
the homogeneous equations: 


V-B=4zap,,, vere= Bee. 
c ôt 


where «œ and $ are to be determined. The volume force law 


is 
F = fe dr, 


and 


f,=p E+“ xB+p,B+yZxE 


c c 


where y is to be determined. The positive sign of p,,B in the 
force equation is a definition. The coefficients are to be 
determined by the requirement of internal consistency and the 
existence of conservation laws for energy, momentum, and 
angular momentum. Thus, you must find for the change of 
field energy: 

(i) ð (B* +E’) _ y cEXB 

or Br 4ar 


-= (je -E + jn- B), 


and for the total force on matter inside the volume V: 
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7 d Ex 
(ii) farts -4 fa ED; | dean, 
dt Ate 
Vv Vv Vv 
where 
r, = E,E;+ BiB; (B?+ E°) 8y, 
: 4n 87 


and 
(iii) a field angular momentum 


L= | dr ZED 
4mc 


satisfying an equation analogous to (ii) above. 


Finally, calculate the angular momentum of a stationary mag- 
netic pole p at r; and electric charge q at r2: 


v= [Hex fe pend] 


dnc jr- rn?  |r—r,f 


(b 


— 


(i) Show first that L is independent of the origin of r. 
(ii) Now choose rz as the origin and recognize both fields as 
gradiants. Thus, 


L= | ar AE Ph 
4e 
Integrate appropriately by parts (i.e., Vd, or V2) and then 


use the Legendre polynomial expansion to find the answer. 
Note on dimensional grounds that L must be proportional to 


qp rı- r 
c |r; = r| 
(c) From the formula for L and the quantization of angular momen- 


tum follows the quantization of electric charge, which must hold 
if a magnetic pole exists. This semiclassical result also holds in 
quantum theory. 


3.18 It is amusing that Maxwell’s equations take a particularly simple 
form in terms of the vector Q = E + iB. For the free field equation, 


1o=-ivxa. 
C 


Verify this and, following the work of 3.17, find the current that 
goes on the right-hand side in the presence of magnetic poles. 
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CHAPTER 4 


Radiation by Prescribed 
Sources 


In this chapter we study the emission of radiation by a given source 
(charge and current densities). The complete solution of a physics problem 
will, in principle, require the simultaneous calculation of the effect of the 
radiation on the source. In many cases, however, the radiation reaction 
is relatively weak so that it can be taken into account in a series of 
successive approximations. Those are the cases we will be dealing with in 
this chapter. Chapter 5, on scattering, deals with the more general problem 
of interacting fields and sources. 


4.1. VECTOR AND SCALAR POTENTIALS 


The equations for E and B can be solved most easily by introducing a 
vector and scalar potential. Since V - B = 0, there exists a vector potential 
A, such that B = V x A. The homogeneous equation 


VxE eR or vx(g +14) =0 (4.1.1) 
c ot c ôt 
implies that 
1 ðA 
E +-— = -Vọ, (4.1.2) 
c ðt 


that is, that there exists a scalar potential @. Thus, E and B are given by 
the equations 


B=VxA (4.1.3) 
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and 


E=--—-_V¢. (4.1.4) 


We have seen in the quasistatic case that E and B do not uniquely 
determine A and @. That is still true here. The substitutions 


A'=A+Vy (4.1.5) 
loy 

‘= o@--—= 4.1.6 

$ nee (4.1.6) 


evidently leave B and E unchanged and so correspond to the same physical 
fields as A and @. This property is called gauge invariance. In classical 
field theory the potentials are introduced as a calculational convenience, 
and their gauge similarly chosen for convenience.' 

The inhomogeneous equations are now 


1, aA 


V-E=--V:-—-V*6=479 (4.1.7) 
t 
and 
2 : 
vxB-1ÆE-yxxa +1) t, (4.1.8) 
c ôt c\c ôt ot c 


There are two especially useful gauges to work in. The first is called 
the transverse, or Coulomb gauge. It is defined by 


V-A=0 (4.1.9) 
and can clearly be reached by the proper choice of y in (4.1.5), starting 
from any Ao, by solving the equation V: Ao + V?y = 0 for y. Equation 
(4.1.7) then becomes 


Vo = -4r7p, (4.1.10) 


the familiar Poisson equation, which produces a scalar potential 


o(r, n= far ed (4.1.11) 


Ir = r'| 


"In quantum theory the potentials play an essential role. 
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which is instantaneously connected to the charge density. This solution 
appears to violate causality — the finite propagation velocity of electromag- 
netic signals—but does not, since the potential œ is not a physical field. 
The test of causality must come in the field strengths E and B, where the 
vector potential must make a contribution that will cancel the acausal 
contribution of V. This comes about from (4.1.8) for A: 


LA _4nj_1 ae 


Vx(VxA)+— 
( ) e ar’ c c ôt 


(4.1.12) 


with œ given by (4.1.11). 
Note that (4.1.12) is purely transverse; that is, the divergence of both 
sides is zero, since 


2 
ae Ona 
or ot 4r 
dp ð 
=——-—_ —(-p)=0 
ot an p) 


We see that (1/4m)V(a3ġ/ðt) is the longitudinal part of j, so that even with 
j and p spatially confined, the longitudinal part of j falls off only like an 
inverse power of r as r— %. (See Problem 4.1.) The contribution of this 
term to A and ðA/ðt at large r will have to cancel the unwanted contribu- 
tion to E from —V¢. (See Problem 4.2.) 

To show that this must happen, it is convenient to use the second 
gauge, called the Lorentz gauge. To introduce the Lorentz gauge, we 
rewrite (4.1.7) and (4.1.8) as 


18 10 ( lad 
-Wot+——+--(--—-V-Al=47 4.1.13 
ý c ðt côt c ot p ( 
and 
a j 
-vatge tu(v ait) ti, (4.1.14) 
c^ at c ot c 
The obvious choice of gauge now is 
ð 
EER (4.1.15) 
c ôt 


which decouples the different vectors from each other and allows the three 
components of A and the one component of ¢ to be calculated as if they 
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were independent scalars. To see that this gauge can always be chosen, 
start with Ao, @o and solve the equation 


V- (Ao + Vx) + 2(4o- =) =o 
côt c ôt 
or 
1 8? Lago 
v-55) --(V-A + tite) 4.1.16 
( c? ar? X e c ôt ( 


Equation (4.1.16) can be solved, in general, by iteration, given y and 
ðx/ðt at t= 0. Of course, the solution is not unique, since one can still 


add any solution of 
1 9 
v5 F\y=0 
( c at : 


to a particular solution of (4.1.16). 
We have now to solve the decoupled equations 


1 aA Anj 
VA - ——_ = 0A = — — 4.1.17 
c? ar c i l 
and 
2 
Cc t 


4.2. GREEN’S FUNCTIONS FOR THE RADIATION 
EQUATION 


We will want a solution of the equation 
Oy = —4af(r, t) (4.2.1) 


with f a given function of space and time. The Green’s function 
G(r, t;r’, t’) that gives 


y= f dr’ dt'G(r,t;r', t') f(r’, t’) (4.2.2) 
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must satisfy the equation 
1 2 
OG = (v = 55) G(r, tyr’, t') = —408°(r — r')êlt-— t’). 
c 
(4.2.3) 


We look for a solution that is a function only of r — r' and t — t', which 
must satisfy the equation 


OG(r, t) = —4rô?(r) 6(t). (4.2.4) 
Equation (4.2.4) invites a Fourier transform: 
G(r.0) = | dk dese" gik, w), (4.2.5) 


which, with 


dkdw ; 
8? r S(t oe cilkerm wt) 
(r) êl) (ny 
gives 
w? 4r 
(5 — Kg =o Qn)" (4.2.6) 
or 
1 1 
=-=, (4.2.7) 
4T w“ k? 
2 
so that 
c? ie dw ei’ 
G(r, t) = ~ fake" [oe (4.2.8) 


We have seen denominators of this type before in connection with 
the causality of atomic polarizability; see (3.4.24)-(3.4.30). Recall that 
G(r, t) will vanish for negative t if we take w in the denominator of (4.2.8) 
to approach the real axis from above. That definition corresponds to a 
retarded potential. That is, the source of (4.2.8) is a ô function pulse at 
t= 0 and r = Q0. Requiring G to vanish for t < 0 is equivalent to requiring 
that the radiation from the source pulse come after the pulse. Thus, we 
have the retarded Green’s function 
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c? | F ad dw eivt 
G.Ar, t) = — lim — ake f —— =; (429 
(r, t) e0 4T? ~» (w + ie)? — c7k? (PeR 


the advanced Green’s function (corresponding to a time-reversed world) 


1S 


2 —iwt 

c ; dw e 

G4 = — lim — | dk e™" | ———————.. 
j 047? (w — ie)? — ck? 


(4.2.10) 


To calculate Gr for t>0, we must close the w integral (4.2.9) in the 


lower half w plane, where there are poles at w = —ie + ck. The result is 
; dw e ' - kee ckt t>0 
WL AS eo ck 
<0 J (w + ie)” — ck 0 t<0 
and so 
c {dk , 
Gr =—s | —e™' sinckt, 4.2.11 
5 al k ( 


always for t > 0. 
The integral over angles of k gives 


æ 


=% 


Gr dk sin kr sin ckt (4.2.12) 
mr 


© 


x 


=£ dk[cos k(r ~ ct) — cos k(r + ct)] 
mr 
0 


o 


= C dk{er-<) a e+e] 


-x 


=E(8(r - ct) — òlr + ct)). (4.2.13) 
r 
The second term is zero (both r and t are positive) so the final result is 
Grir, t) =E 8(r — ct). (4.2.14) 
r 


Having found the solution, we can verify directly that (4.2.14) satisfies 
(4.2.4). (See Problem 4.9.) 
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Returning to the original equation (4.2.1), we find the retarded solu- 
tion 


y(r, t) = Peet Ge t- roel ee) (4.2.15) 


or 


y(r, t) = [pE e (4.2.16) 


where the retarded time tp = t — |(r — r')/c| is the time at which a signal 
must be emitted at r’ to reach r at the time t. 

A useful way to picture the content of (4.2.16) is to imagine a 
spherical wave converging at time f on the point r. The retarded time tr 
is the time the wave crosses the point r’, picking up its contribution from 
the source f(r’, tr) as it does so. 

The fields A and @ radiated by the current and charge density j and 
p as described in (4.1.17) and (4.1.18) are now given by (4.2.16): 


ani] w (4.2.17) 
cJ |r-r'| 


and 


C= Fe jeer’ tr). (4.2.18) 


In this gauge, the absence of the instantaneous electric field encoun- 
tered in (4.1.11) is obvious, since all the potentials and fields are retarded. 


4.3. RADIATION FROM A FIXED FREQUENCY 
SOURCE 


We suppose j and p are given by 


i(r, £) = jolr)e (4.3.1) 

and 
p(t, t) = polje ™ , (4.3.2) 

with 
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Vv ‘jo =, IW Po = 0. (4.3.3) 
We calculate the potentials as 


, 


am) =i | -E oen (4.3.4) 
cJ r-r] 


and 


b(r, t) = | a polr’)e i", (4.3.5) 


The signature of radiation is that, for sufficiently large r, E and B go 
like 1/r so that the energy radiated through a solid angle dQ: 


dS- P=--EXB-dS 
dr 


=L ExB-îr?dA, 
4r 


goes like a constant (independent of r). The region where the 1/r behavior 
dominates the fields is called the wave zone. The radiated power per unit 
solid angle is given by the coefficient of 1/r? in the Poynting vector, so 
that the calculation of radiation requires only the leading term in 1/r in 
(4.3.4) and (4.3.5). Thus, 


and 


wit + Pty (E), (4.3.6) 
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The last term is negligible in e~*’®, provided the wavelength A is such 
that 


=a]. (4.3.7) 


Note that a macroscopic source could require kilometers to reach the 
wave zone as defined above. Clearly, the method we are discussing here 
does not apply to such a case. 

There remain, as r— &, 


—imw(t ric} 
A= e | ae exp i? r' Jior) (4.3.8) 
c 


cr 


and 
—ta(t--rfe) 
(o = far exp( i +r’) putt’). (4.3.9) 
r c 


We recognize that k = wf/c is the radiated wave number. In differenti- 
ation, (0/dx;)f goes like 1/r, so that k can be treated as a constant. Thus, 


eren 
A= jk (4.3.10) 
cr 
and 
eilkr-en 
$= mA (4.3.11) 
where 
ix = i dr' e`" jor’) (4.3.12) 
and 
Px” far e ™" polr’). (4.3.13) 
Equation (4.3.3) leads to 


We find the fields from A and ¢. To leading order in 1/r, 
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ðA 7 gren : ikx; 


Jk (4.3.15) 
Ox; cr r 
and 
a i(kr-~ wt) ik ; 
Pe aaa, (4.3.16) 
OX; r r 
so that 
c ot 
iw eo i(kr—at ik i(Ar--wt 
= de e = Pre (raD (4.3.17) 
and 
je 
B = V x A = ik x =. (4.3.18) 
cr 


Observe that both B and E as given in (4.3.17) and (4.3.18) are transverse, 

B obviously and E from (4.3.14). The second term in (4.3.17) serves 

merely to cancel the longitudinal part of j,, leaving only the transverse 

part, 

KK -jk 
k? 


, (4.3.19) 


har =k - 


which of course would have appeared more naturally in the transverse 
gauge. Thus, the E and B fields have the properties of propagating fields 
with polarization vector jkr, wave number k, and satisfying the correct 
right-hand rule relating k, E, and B. 

We calculate the radiated energy [averaged over a cycle as described 
in (3.5.19)] as 


——- = Pore (4.3.20) 
dt dQ 
=L ExB- îr? 
4n 
cw, A 
BTC 
wk 
= —— jij 4.3.22 
Borate Jkr ( ) 


2 


8rc? 


kjěr jkr 
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and 
ae ae ere 
Jit jkr = JR jik pel Kix k. (4.3.23) 


These formulas are very general and are correct to order 1/r. In the 
next two sections, we will apply this general theory to specific cases. 


4.4. RADIATION BY A SLOWLY MOVING POINT 
PARTICLE 


We consider the radiation from a slowly moving point particle, with 
current density 


j(r, t) = qv(t) &lr — r(t)) (4.4.1) 


where r(t) and v(t) are the position and velocity of the particle at time t, 
and q is the particle’s electric charge. Slow means u/c <1. 
To find the vector potential produced by j, we Fourier transform 


jr, t) = f dwe™*ja(r) (4.4.2) 


[j. here denotes the object corresponding to jo(r) in Section 4.3]. Equation 
(4.3.8) then gives A in the radiation zone: 


-iw(t—r/c) 
A(r, t) = | do —— | a iaee (4.4.3) 


where k = (w/c)f. 

The exponential e7” " = et’ is approximately constant for a 
slowly moving particle, since A(r'w/c)~ v/c. If we make the obvious 
choice of origin, (4.4.3) becomes 


ther 


1 R 
Alr, t) =— | der!) | eae | dr’ ir’ t- r) (4.4.4) 
rc rc C 


which with (4.4.1) gives 


A(r, t) = 4 y(4 = =) + (5). (4.4.5) 


r 
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Thus, A is given by the radiating particle’s velocity at its retarded time. 
We immediately find 
= - a(t- 7) +o(5) (4.4.6) 
T re? C/T r? A 


ap=a-ff-a (4.4.7) 


where 


is the transverse component of a, the acceleration at the retarded time. 
Remember that f is the direction of observation. 
The magnetic field is 


B= vxA=-L(-W) xa-PxE+ o(5), (4.4.8) 
rc r 


The instantaneous Poynting vector is 


p= ExB 
4r 
2 r 2 
= ———larit-—- fT 
4rr’c? i > 
q? 2 
F [a? — (af) |f, (4.4.9) 


where a is evaluated at the retarded time ft — r/c. 
If we average over a time T (e.g., a cycle of simple harmonic motion), 
we have for the average rate of radiation per unit area and time 


eat [a?— (a-)7]8, (4.4.10) 


Arre? 


in agreement with (4.3.22), recognizing that q7? = w*|j,|°/2 for simple 
harmonic motion and k=0. Equation (4.4.10) tells us that the time- 
average power radiated per unit solid angle, dP/dQ), is independent of r: 


aa -= f dS-P=da Ck (a*-(a-f)*). (4.4.11) 
aQ 4re? i l 
over dN 


Note that in this approximation (called electric dipole), plane polar- 
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ized light is produced by a linear particle trajectory, the plane of polariz- 
ation being the plane containing the particle motion r(t) and the direction 
of observation, Ê = Å. With any other than a linear trajectory, the nature of 
the polarization depends on the direction # of observation. Except for 
special directions, the polarization is, in general, elliptic. 

The dependence on the direction of observation of the intensity of 
radiation is given by (4.4.10). For a linear trajectory, it is sin? 6, where 0 
is the angle between the trajectory direction and the direction of 
observation. The total radiated energy per unit time (power) is given by 
the integral of P over a distant surface 


—2 2 2 
aM as-@ = [E E da sin? g= iiae, (4.4.12) 
dt åm e 3e 


S 


4.5. ELECTRIC AND MAGNETIC DIPOLE AND 
ELECTRIC QUADRUPOLE RADIATION 


The approximation made in (4.4.4), k- r’ <1, can be applied to a more 
general current distribution than the one given in (4.4.1). It forms the basis 
of a multipole expansion analogous to the electrostatic and magnetostatic 
multipole expansions discussed earlier in Chapters 1 and 2. For small kr’, 
that is, for dimensions of the radiating system much smaller than the 
radiated wavelength, the first few nonvanishing terms in the expansion 
provide a good approximation to the radiation amplitude j, of (4.3.12). 
For kr’ not small, the multipole expansion can still be carried out, as we 
shall see in Chapter 5, but is not equivalent to an expansion in kr’, and 
may converge slowly. For the moment, we confine ourselves to the first 
few terms, which we evaluate by expanding in kr’. 

We consider, then, a given frequency w and wave number 
k = F(w/c) and evaluate the radiated amplitude of (4.3.12) 


ix a f dr'e ~ere alr) (4.3.12) 


where from (4.3.3) 


V: jor’) = iwpo(r'). (4.3.3) 


We proceed by expanding the exponential 
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ek =] iker +o. (4.5.1) 
The first term gives 
j= f dr’ jo(r’). (4.5.2) 
For a confined current, 
jp =- farey" jolt’) (4.5.3) 
or 
j= -iv | a po(r’)r’. (4.5.4) 


The question of origin of coordinates does not enter into (4.5.4), since 
J p(t’) dr’ is conserved and, hence, has no component with w different 
from zero. The term “electric dipole” is now clear—the electric dipole 
moment of the charge distribution pç is given by 


Pe = fa polr’) r’ (4.5.5) 


and so 


j = —iwpe (4.5.6) 


independent of the direction of radiation. 
The complex electric field vector is given by (4.3.17) and (4.3.19) as 


w i2 


EO = Se“ (—ia) pe- FF. ps), (4.5.7) 


and the magnetic field asf x E: 


BY =f x EM, (4.5.8) 


The angular distribution of the radiation is given by (4.3.22) and 
(4.3.23) as 


dw 
dtd 8 


4 
w A A 
-a (lpel - pë- Pe). (4.5.9) 
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We make contact with the formula (4.4.11) for the radiating point 
particle by recognizing that for motion described by 


r(t) = Re(roe 7‘) (4.5.10) 


the acceleration a will be given by 


a(t) = -W° Re(ro e Tier) 


and the complex electric dipole moment corresponding to (4.5.5) by 
Pe = qro. (4.5.11) 


These connections lead back to (4.4.11). The apparent factor of two 
difference comes about because |a7|=3 w*|ro/. 

Concluding, we see that for a system with small kr’, radiation emission 
will be largely determined by the system’s electric dipole moment pz. pr 
is a complex vector, independent of the details of the charge distribution; 
for example, as we have seen, it does not distinguish between a moving 
point charge and an oscillating continuum charge distribution p. 

We turn next to the second term in the expansion (4.5.1), giving for 
the next approximation to jx, 


je = -i | ae k- r’ jo(r’). (4.5.12) 


Normally, j{? will be smaller than įjẸ? by kr’ ~ v/c; the exception is usually 
when the electric dipole moment vanishes for reasons of symmetry. In 
atomic and nuclear physics, for example, this happens for transitions with 
AJ > 1, or AJ = 1 and no change of parity. 

We manipulate (4.5.12) in a familiar way. We write 


tere ; ' ae a f 
—~ik. r'joe(r’) = ~ik;x{ jr’) = ~ in| ihes = X¢joi) + Xi Joe i “le 


so 


(iP), = | ~ik +4! joer’) = ik x far od? “ht 1 
: 


k. 
+ i$ | arxiv" -jo 


or 
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% . wki Puree ' 
(i) = i(ck x pu), -= f ar XjXcpo(r'). (4.5.13) 
In (4.5.13), pm is the recognizable magnetic dipole moment 


pu= | ar r xbw) <hl), (4.5.14) 
cC 


the integral 5 f dr' x! xtpo(r') can be replaced by the traceless quadrupole 
tensor 


1 1 
Qe, = ll ae’ po( xixi Ta ser?) (4.5.15) 


since ĝe inserted into (4.5.13) will produce a longitudinal contribution to 
j and, hence, will not contribute to radiation. 
We compare the form of magnetic dipole radiation with that of electric 
dipole radiation. 
The electric vectors (jr) are given for the two cases by 
EP « jf} = —iw(pe — pe: FF) (4.5.16) 
and 


EQ xj = iwf X pu. (4.5.17) 
The corresponding magnetic vectors are given by B = f x E or 
BY « -iok x pe (4.5.18) 
and 
BY? x —iw(par — FF: pm). (4.5.19) 


Thus, the transformation from electric radiation to magnetic radiation 


Equation (4.5.20) reflects a general symmetry: Maxwell’s equations for 
propagating electric and magnetic fields in the absence of sources are 
invariant under the transformation E— B, B— —E. Note the importance 
of the minus sign. Without it, the Poynting vector would go in the wrong 
direction! 
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Finally, we consider the electric quadrupole radiation (magnetic quad- 
rupole will come in the next order of kr’): 


jE = - ok: Qic (4.5.21) 
and 


ee 
£2 = eez kou z kOm zt), (4.5.22) 
cr 


The angular distribution of power radiated will be 


dWeg eM QUEXI Die ee (xiQiix)) (x4 QkmXm) 
dt dQ} r? r? r? ` 


(4.5.23) 


Note that there is no general rule against interference of different 
radiation multipoles, although in some quantum transitions it may be 
forbidden. For example, in a J = 1 to J = 0 transition, ED and MD will 
not interfere, since ED requires a parity change and MD requires no 
parity change in the system. However, in J=2—J=1 with no parity 
change, MD and EQ can and generally will interfere. In contrast, in 
scattering problems at large kr’ there is usually interference between many 
multipoles. We shall return to this question when we discuss general 
multipole radiation in Chapter 5. 


4.6. FIELDS OF A POINT CHARGE MOVING 
AT CONSTANT HIGH VELOCITY v: 
EQUIVALENT PHOTONS 


We write equations in the Lorentz gauge 


2 åri 
Fo uea 6 
cor c (4 
and 
2 18? 3 
y 252 d= —4r7p = —47qgh(r — vt). (4.6.2) 
c 


We look for a solution A(r — vt), ọġ(r ~ vt). With v and A in the z- 
direction, these satisfy the equations 
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2 8# æ ( z 2 
— + — + — |1- “j= —-4 8 — ví K 4.6. 
E ay? az? c? | i BOE) f (Saa 
Change variables to 
,. zu 
z = =. 
z 
c 
Then 
a a a (4°) v|? 
— + — + — || 7] = —4arqvd? a(z 1-5) 
fe ay? az?/\ o qvò (p) c? 
v 
& r’ E 
= -4rq ( B c 
1-3\1 (4.6.4) 
c 
wherep = @,x + ĉ,y and r’ =p + @.z’, so that 
ne 2, 1/2 
papie (4.6.5) 
Pa 
ce? 


The solutions of (4.6.4) are immediately given by the Coulomb poten- 
tial: 


Pag tk (4.6.6) 
C v? r' 
rae 
and 
q 1 
$= =F (4.6.7) 
vor 
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The electric field is 


c ot 
f 1 qv 1 (z—vt)v z—-ut 1 1 qp 1 
noae T Se 2 
CO EIS > i ace paer E idgee Me 
2? c? e er 
or 
oe een (4.6.8) 


The magnetic field is 


B=vxa=(2, +8, Š) x BU es Nee AGG) 
Ox ay c 


Remarkably, the electric field at r points from the present position of the 
charge (vt) to the field point, as shown by (4.6.8). Note also the first hint 
we have seen that v > c would cause major problems.” 

The fields E and B look remarkably like a light wave if the particle 
velocity v is very close to c. First, the field packet moves with a velocity 
very close to c. Second, it is concentrated near z = ut, so that E, is small 
compared with E, ; thus, E is almost transverse, B is exactly transverse, 
orthogonal to E, and almost equal to E in magnitude. This circumstance 
can be exploited to relate a process induced by fast charged particles to 
the same process induced by low-frequency photons. 

To do this, we calculate the radiant energy incident per unit area, 
time, and frequency by Fourier-transforming the electric and magnetic 
fields (4.6.8) and (4.6.9) as 


*The fields E and B can also be calculated by Lorentz-transforming the Coulomb field 
of a charged particle at rest. (See Problem 6.4.) 
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x 


E(r, w) = | vast rR (r,t) (4.6.10) 
and 
B(r, w) = - “x E(r, w). (4.6.11) 


Since E is effectively transverse and v/c = 1, the integrated Poynting vector 
will be, with E(r, w) = e@/°E(p, w) and B(r, w) = e“°@/B(p, w), 


faso, t)= Pe =| E*(p, w) X B(p, w) dw (4.6.12) 
= raul | ieo, w)|? dw (4.6.13) 


and the number of photons per unit frequency per charged particle 
incident will be 


dN _ 2c æ f p [E(p, o)? (4.6.14) 
dw 4r hw 


The factor of two comes from adding negative to positive frequencies. 

How are we to interpret (4.6.14) in quantum theory? Since quantum 
theory predicts probabilities, the number of photons in the range 
wi < w < w2, OF E (dN/dw) dw, must be interpreted, if small, as the 
probability p(w, 0), of finding a photon in that range with a single 
incident charge. That is, if N, is the (large) number of incident charged 
particles, the number of photons emitted in the calculated frequency 
interval will be 


N; = Nepo, w), 


thus giving the same effective answer as (4.6.14). 

If the calculated probability p(w2, w,) turns out to be large, doubt is 
cast on the calculation; the reaction of the target system on the charged 
particle must be taken into account. This will be the case, for example, if 
we consider very low frequencies, where the factor 1/w in (4.6.14) 
becomes large. 

We should comment here on the validity of using classical field theory 
to calculate effects associated with the radiation of low-frequency quanta. 
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We learn in quantum electrodynamics that classical field theory is valid 
when the quantum state contains many photons per volume à? (with A 
the wavelength). This is certainly not the case for the problem we are 
dealing with here. Quite the contrary, we consider the radiation of one 
photon at a time. There is, however, another regime in which the classical 
equations are applicable. That is a régime where we can limit ourselves 
to a linear approximation in the field strengths. In that case, since the 
quantum equations of motion are the same as the classical ones, classical 
solutions hold as well for the quantum field operators. Thus, here and in 
Section 4.8, where we discuss low-frequency bremsstrahlung, the 
specifically quantum properties of electromagnetic fields may be ignored. 

Of course, we cannot expect this classical calculation to hold for all 
frequencies w and radii p. The frequency must be smail enough so that 
the energy quantum fiw is negligible compared to the energy of the inci- 
dent particle; the radius p must be larger than the wavelength of the 
particle, A = f/p, since otherwise one cannot give classical meaning to the 
location p. With those caveats, we go ahead and calculate 


1 foot 
E(p, œ) =| ae caine ee (4.6.15) 


2, (4.6.16) 


leading, with v/c ~ 1, to 


N 


1- 


a |S 
N 


E(r, w) = 


i 


iw(z/c) d 1 ,ta(z'/c) 
IP £ | at (4.6.17) 


Von c (ø + gy 


and with z' = pu, 


~*~ 


-41 Pitz) du rtd is ew 
E(r, w) = e Seer a z“. (4.6.18 
) V2r cp” G + uwy” e ( ) 


-x 


The factor exp{iw(p/c)V1 — v?/c?u} produces a classical cut-off for p 
in (4.6.14): For p(w/c)V1 — v?/c? > 1, the oscillating exponential will de- 
crease the u integral from its value at w = 0. This comes about because 
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at a transverse location p, the electromagnetic pulse has a characteristic 
time of passage 


so that the characteristic classical frequency w,, in the Fourier transform 
will be wa ~ c/pV1 ~ v2/c?. 

For w < wu, the approximation w ~ 0 can be made. For w > w, or 
p> c/wV1 — v?/c?, the Fourier transform will fall off, as shown explicitly 
in (4.6.18). We thus have, for 


P< Pmax = 


iw 


(z/c) 
q pe du 
E(z, p, o) = £ ee 
(Z,p, w) Van p? F 1a + u?” 
tw(z/c) 
“eA ia 


The equivalent photon spectrum is given by substituting (4.6.19) in 
(4.6.14): 


Pmax 
2 
dN _ 2g" | dp (4.6.20) 
dw mħhwc p 
Pmin 
As discussed earlier, 
Pmax = oes and Poin 7 7 
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with p the incident particle momentum. The final result is 


AN 2D pe @ 


4.6.21 
dw wmhc w ( ) 


Note that since v is close to c and iw <cp (the particle energy), the 
argument of the logarithm is very large, and therefore the log will be 
insensitive to the precise value of these cut-offs. Thus, (4.6.21) makes 
quantitative, not just qualitative, sense, since in addition the factor q7/hc 
(equal to 1/137 for electrons) allows the probability ie dw(dN/dw) to be 
small. 

Equation (4.6.21) gives directly the relation of a fast charged particle 
induced cross section do,, with energy loss hw, to the photon induced 
cross section do, at frequency w. If the particle is an electron, it is for a 
range Aw of frequency 


= oe eg ai ag dee): (4.6.22) 
mhe 


za 


This relation was discovered in the early days of quantum theory by C. 
F. Weizsäcker and E. J. Williams. 


4.7. A POINT CHARGE MOVING WITH 
ARBITRARY VELOCITY LESS THAN c: 
THE LIENARD-WIECHERT POTENTIALS 


We return to the general form (4.2.17) and (4.2.18). This gives 


AG) == arora a af: -r -Eh er (4.7.1) 


and 
o(r, t) = [Ee t- e=) p(r’,t’). (4.7.2) 
lr—r'| c 


The current and charge density are those of a point particle: 
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i,t) = qv(t’) 8°(r' — r(t’)) (4.7.3) 
and 


p(r’, t’) = q5*(r’ — r(t’)) (4.7.4) 


where r(¢’) and v(t’) are the coordinate and velocity vectors of the particle 
at time t’. We carry out the dr’ integral first. There results 


_4 dt' —» lror@)| : 
A(r, £) AFET amit t E= v(t’) (4.7.5) 


Cc 


and 


=g [E sfr- p -EO 
o(r, t) {a t a i‘ (4.7.6) 


To carry out the dt’ integration, we are first required, given a field 
point r and a time z, to find the retarded time t' = tg, such that 


(pee ol, (4.7.7) 
Cc 


Equation (4.7.7) has only one solution, provided the particle velocity is 
less than c. To see this, imagine again a spherical light wave aimed to 
converge on the point r at time ¢. It will cross every charged particle at 
some time fg and only cross each particle once, since it is moving with 
velocity c >v. Clearly, the time the spherical wave crosses the particle 
trajectory is the retarded time for that particle. In general, one cannot 
solve for tz analytically, but the argument just given shows that a numerical 
calculation [given r(t), of course] can succeed. In the special case of 
uniform motion, (4.7.7) leads to a quadratic equation for tr, which can 
be solved algebraically. In fact, the procedure we are about to follow here 
could be used as an alternative way of finding the fields of a particle 
moving with constant velocity. Both the retarded and advanced fields of 
a uniformly moving charge are equal to each other and to the convective 
fields described in the last section. 

Assuming we have found the solution of (4.7.7) for tr, we must do 
the integrals in (4.7.5) and (4.7.6). To do that, we change variables to the 


158 Radiation by Prescribed Sources 


argument of the ô function, that is, 


pop paw] (4.7.8) 
Cc 
Then 
kE ar(ı _ ror’) drC 2) 
cejr—r(t’)| dt 

or 

ane (4.7.9) 

dt i- rev 


where f is the unit vector pointing from the retarded position? of the 
particle r(tg) to the field point r, and v is the velocity dr(t')/dt’ at 
t= tr. 

The integrals (4.7.5) and (4.7.6) can now be done using the 7 variable: 


dt V(tr 
nu al rao ae a T = r(te) 
c |r—r(tr)| 
or 
A= (4.7.10) 
cs 
and 


°A point to keep in mind for possible future reference is that the equivalent denominator 
for the advanced solution is 


where f and v are now calculated at the advanced time: 


[r — r(ta)| 
z : 


tg=tt 
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¢=4 (4.7.11) 


where 


s=|r 


ayia mts) AUA (4.7.12) 


4.8. LOW-FREQUENCY BREMSSTRAHLUNG* 


Before taking up the E and B fields, we consider the radiation of low- 
frequency photons in the course of a scattering event. As in our discussion 
of equivalent photons in Section 4.6, we must confine our calculations to 
low enough frequencies so that the quantum corrections will not be signi- 
ficant. That means that Aw must as a matter of principle be small compared 
to characteristic energies of the radiationless scattering process; for 
example, we must have 


ho <W (4.8.1) 


where W is the incident energy of the charged projectile, 

As a matter of practice, we will consider w also smaller than the 
characteristic classical frequencies of the motion, for example, the classical 
frequency w, ~ v/b, where v is the incident particle velocity and b the 
impact parameter (assuming that b is within the range of the force). This 
is because a calculation of the frequency dependence of the process must 
be specific to the system being considered; we are interested here in 
general results, including the case of nonclassical particle motion. 

What we will do therefore is to calculate the zero frequency limit of 
radiation by a system that we imagine to be correctly described —~either 
by classical or by quantum equations, whichever is called for. 

We imagine a scattering event (see Figure 4.1) in which the 
observation of electromagnetic radiation is made at r, between times f; 
and t2, where ft, and f are such that fig is before the particle has entered 
the force field of the scatterer and fz, is after the particle has left the force 
field of the scatterer. (This can always be done: Choose tig, r, and hr, 
r2 first; then find ¢; and f by clocking rays from r; to r and from rp to r.) 

We calculate the electric field as the transverse part of —1/c dA/dt. 


*Bremsstrahlung is German for braking radiation. 
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I2, tor 


Ti, fir interaction region 


Figure 4.1. 


Then 
E= ; (4.8.2) 


and the Fourier transform of E is 


h 


E(r, w) = =| dt ef- . = ar). (4.8.3) 


n 


We do not have to integrate before t, or after t since there will be no 
radiation field (i.e., no E ~ 1/r) at those times. 


Given E(r, w), we know how to calculate the energy radiated per unit 
area. It is 


ae | P -Fdw (4.8.4) 


where F is the Poynting vector, 
P(w) = —E*(r, w) x B(r, w) (4.8.5) 

4r 
=-—|E*(r, w)? î, (4.8.6) 

4r 


and where f is the unit vector pointing from the target. Remember that r 
is asymptotic—that is, the unit vector 
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r — ¥(tr) 


T= 
|r — r(tr)| 


is independent of tg during the scattering. 
We consider only w near zero, as discussed earlier. More precisely, 
we calculate the energy spectrum at w = 0. It is 


dw A 
——— = 2A(0) -k 4.8.7 
aSibo H (4.8.7) 


where the factor 2 takes into account both signs of the frequency. We 
observe that E(r, œ = 0) can be calculated from (4.8.3). It is 


E(r, o =0) = - z=- lart, h) — A(T, t )] (4.8.8) 
and from (4.7.10) and (4.7.12), 


Alr, b) = ——22—— (4.8.9) 


and 
Alr, n) = — 1 (4.8.10) 


re(1 - wE) 
c 


where v; and v2 are the velocities of the charged particle before and after 
the scattering. Thus, 


1 1 V2 Vi 


~ . (4.8.11 
re? Vin! A ( ) 


E(r, o = 0) = — 


The total energy radiated per unit solid angle is 


oO Oe ke | dsf | du E*l, w): Ef, 0) (4.8.12) 


AQ 0 
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and per unit solid angle and frequency is, for w near zero, 


2 2 
dw q V2 a 


dNdw (2) è 


A A 
Vo f 1 vı f 


c C T 
The number of photons in a frequency range” dw is thus 


2 


dN _ aW q? c c 
ddw dAdwħhw (2r ħcw 


(4.8.13) 


The meaning of (4.8.13) is the following: Zf the charged particle comes in 
with velocity v; and is scattered with velocity v2, the number of photons 
it will radiate in the frequency range Aw and solid angle AQ is 


dN 
Vasse beat la l 4.8.14 
Sa | [a ey) 
Aw 


an 


The probability of that charged particle event is the cross section for 
the event times the incident number of particles per unit area. Therefore, 
the cross section for producing a photon in the frequency range Aw and 
angular interval AQ is given by (4.8.13) in terms of the radiationless cross 
section da/drt, where dr describes the final particle state; for example, in 
a scattering, dr might be dQ of the final particle. The relation is 


2 


dø, _ 1 gi c c do 
dQdudr Qr? hee a a dr 
1-+~— žá 1- 


Equation (4.8.15) can be generalized to several incoming and outgoing 
charged particles. The squared bracket times q7 is simply replaced by 


Remember, as discussed in Section 4.6, that the quantum mechanical translation of 
“number of photons in a range” is the “probability of radiating a photon in a range.” 
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sv ga 
doy _ 1 re =, c do 
2 
dQdwdr (rhc F yf pe fo as 
c Cy OE 
(4.8.16) 


where f denotes final, i initial, The transverse component squared of a 
vector j is, of course, 


2 


(ir) =G-j- FFP =f -G Y. (4.8.17) 
We see from (4.8.15) that the radiation of fast particles peaks strongly 
near the direction of v, or v2. Thus, although 


tlvi—v,-## |’ sin? 
>| ++ — | = ————_~ (4.8.18) 


2 


(1 E Vi cos 0) 
c 


vanishes at 8 = 0, it peaks strongly at cos 0 = v,/c, where it has the value 
1 
ee 
-C 
% 


The low-frequency radiation thus comes out mainly in two sprays near 
v, and v2. Integrated over solid angle, each of these sprays gives 


(4.8.19) 


1 


sin? 6 d@ sin 0 (1 — x*) dx _ 
fa p eae’ z= 2m | =~ ~ 4r log 


v 
(1 - “cos 0) = (1-94) bp- 
c Cc 


for v/c close to 1. 

An important consequence of the finiteness of E(r, w = 0) is the diver- 
gence of the cross section for photon emission at low frequencies, since 
f dw/w diverges at w = 0. The high w divergence in the integral is a 
consequence of our low-frequency approximation. It turns out in quantum 
theory that the meaning of the low-frequency divergence is that no charged 
particles can interact without radiating— perhaps no big surprise. That 
means that one cannot define an “elastic” amplitude which includes 


(4.8.20) 
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charged particles in the initial or final state, since there is always ac- 
companying radiation. One can however define and measure a cross 
section for a charged particle to scatter with a finite energy resolution for 
the scattered particle. The cross section will be a function of the incident 
energy, the scattered angle, and the resolution AE. Because of the zero 
rest mass of the photon, no matter how small AZ, any number of low- 
energy photons could be produced in the process. The low-frequency 
divergence of the bremsstrahlung cross section is a signal that as AE > 0, 
the ‘elastic’ cross section goes to zero. The meaning of ‘elastic’ is charged 
particle energy loss less than AE. 

The mathematical working out of the problem® makes use of the fact 
that the total cross section for a finite resolution, including all radiation, 
is finite. Thus, 


AE 


Celastic + f dw 


0 


do, 


dw 


should be finite. But do,/dw, as we have seen, goes like 1/w, so 
f o (do,/dw)dw diverges at w = 0. Therefore, Ceiastc Must have a canceling 
divergence: 


Selastic = oa = fro dw), 


where oy is the lowest-order calculation (in q7/hc) and oy f(w) > do,/de 
as w — 0. This has the embarrassing problem of producing a negative 
elastic cross section. The remedy is found in quantum electrodynamics, 
where it is shown that an exact calculation would replace 


1-[flw)do vy exp- | sw) dw) =0, 


since the integral is positive and divergent. This is the way the elastic cross 
section is made to vanish. The measured cross section with a resolution AE 
will be, in lowest order, 


This paragraph is impressionistic and must be read with that in mind. The formulas 
given are not mathematics. However, the final result is correct and important for experiments 
with charged particles. 
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AE 


O measured = Celastic + Í dw Odo,ldw . 
0 


or 


Cincaea = 00{ 1 ~ frw dw) + o frw do 


oe 


which depends logarithmically on AE, but is always finite. The exact 
formula for small AE will be 


IR 


To exp( - f 1) dw). 
AE 


The appearance in (4.8.15) of q*/hc ~ 1/137 keeps the correction from 
being large except for very small AE’s. 


4.9, LIENARD-WIECHERT FIELDS 


We now use (4.7.10-4.7.12) for A and @ to calculate the E and B fields 
of a charged particle moving with arbitrary velocity v ({v|<1).’ To do 
so, we must be able to calculate space and time derivatives of tr. First, 
dtr/dt. Since tr = t — |x — y(tr)|, 


ot a _ ot 
Rap + fey 
ot ot 


"From now on we use x to designate the field point, y the particle coordinate, and r 
to designate x — y(tg); thus, |r| = |x — y(tr)|, P= (x — y(tr))/r, etc. The velocity and acceler- 
ation of the radiating particle are always taken at the retarded time. In addition, we choose 
units in which c = 1. This saves a lot of writing and prevents a lot of trivial errors. The final 
answer to any problem can always be expressed in conventional units by dimensional analysis. 
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so that 
ae i (4.9.1) 
ðt y-fFy § 


where all symbols stand for the retarded values. Next, Vig = —-F+f-vVip, 
so 


A 


Vir = ogre (4.9.2) 
1-f-v 


We can now proceed to E and B. From (4.7.10) and (4.7.11), 


Vo = Ls = aje sa Vie]. (4.9.3) 
sS sS Otr 
and 
ðA (: v as Bi 
ee pe NER 4.9.4 
at Os og ðtr/ ðt ( ) 
where a is dv(tr)/dtr. Thus, 
ðA 
E= -V$ -— 
: ðt 
ð 
- 4(e- v -2-5 - Goer (4.9.5) 
Ss Otr S S S dtr 
which, with 
COE E SE (4.9.6) 
Otr 
gives 
3E 2 
s?= = (r -= ry)( 1- v?) +r x [(r - ry) x a]. (4.9.7) 
q 


Notice that for large r the first contribution goes like 1/r°; the second goes 
like 1/r and is transverse. 
Turning to the magnetic field, we obtain 
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B 
Popdy a, (4.9.8) 
q q s 
_VtrXa Vs 
o x v; (4.9.9) 
with 
Vtr = — E dtr = r 
ðt s 
and 


Pa os 
Vs =f- v + — Vir 
ðtr 


we find, after some algebra, 


B=fxE. (4.9.10) 


An aid to memory in (4.9.7) is to define a “virtual present radius,” 
r, =r — vr, that appears twice in (4.9.7). Thus, 


sE 3 
—=r,(1 - 0°) +r x (r, X a) (4.9.11) 
q 


and B is still f x E. 

We call r, the virtual present radius because it is the value r would 
have at time ¢ if the radiating particle kept on the course it was following 
at time tr for the time t — tr = r. This should be clear from Figure 4.2. 

The radiation fields are thus given by 


E, = “rx [r, x a] (4.9.12) 
AY 


3 


V(t — tp) = vr 


Figure 4.2. 
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Figure 4.3. 


and 


B,=?xE,. (4.9.13) 


Of course, E, and B, are transverse to fF and orthogonal to each other. 

For small velocities, (4.9.12) and (4.9.13) reduce to our previous 
results for electric dipole radiation, (4.4.6-4.4.8). 

For v ~ 1, one sees that the factor 1/(1 — v- f)? peaks the radiation 
sharply in the direction of v, even though the amplitude vanishes quite 
close to 0 = 0. It follows from (4.9.12) that E, and B, vanish whenever a 
and r, are parallel. That this always happens for two values of f can be 
seen geometrically, as shown in Figure 4.3. 

The two segments of the dotted line give two positions of f such that 
r, is parallel to a and, hence, for which E and B vanish. There is no other 
direction in which E vanishes. Note that as v —> 1, the intersections Pi, P2 
and the vectors f, and f, move to the direction of v. 

The fact that E vanishes near 0 =0 does not prevent the radiation 
from peaking forward (as we have already shown in our discussion of 
bremsstrahlung in Section 4.8.) We will discuss this for the simple case of 
a parallel to v, so that 


E=frx(rxa), (4.9.14) 
S 


very much like the low v electric dipole radiation, but with the factor 
(1 — v cos 6)? in the denominator. 
The intensity of radiation crossing the distant sphere is 
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Pe eo 
P.î= at ys sin? 0 (4.9.15) 
4 


which vanishes exactly for 0 =0 and 7, but whose angular dependence 
for small @ and v ~ 1 is 


8? 
Ss 
dt dQ | ej 
ERE 


(4.9.16) 


producing a sharp maximum at 8° = 2(1 — v)/5. 
The calculation of the total radiated energy is elementary, but com- 
plicated. The case of a parallel to v (4.9.15), however, is quite simple. 
We choose to calculate the rate of radiation by the particle, that is, 
dWldtr dQ, rather than the rate of radiation through the distant sphere, 
dW/dt dQ. These two rates are different, since dtg/dt = r/s. Of course, 
integrated over time, they are equal, since 


[Za f ai; (4.9.17) 
dt dtr 


However, the rate of energy loss by the particle in its trajectory is generally 
the more interesting question. We calculate, for a parallel to v, 


aW, dW, a 
[ an M= {ao dW ot 


dtr dQ dt dQ) dtr 
gz dQ sin? 6 (4.9.18) 
(1 — v cos @)° ci 
2 ag? 
er (4.9.19) 


It is easy to see from (4.9.12) that the parallel and perpendicular 
components of a do not interfere in the total energy radiation rate after 
integration over the azimuthal angle g. The total radiation rate from the 
perpendicular component of a is not so simple an integral as (4.9.18). We 
give the result: 


2,2 
{ao dW, _2 eor, (4.9.20) 
dtpdQ 3(1-v?) 


We shall see later (see Problem 6.3) that (4.9.19) and (4.9.20) are 
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simple consequences of the relativistic transformation properties of accel- 
eration. 

We consider qualitatively one more topic in this section: radiation by 
a fast particle in a circular orbit, as in a cyclotron. Strictly, the spectrum 
is a line spectrum at the fundamental cyclotron frequency wo plus over- 
tones nwo. Obviously, very high overtones will dominate, since the pulse 
of forward radiation sweeps rapidly by the observer. We reason as follows: 
As the radiation sweeps by the observer, it has an angular width [as we 
have seen in (4.9.16)] of order A@ ~ V1 — v. It sweeps by in time 


At~ —, (4.9.21) 


However, it is not d0/dt but d@/dtr that is controlled at the accelerator. 
Since d0/dtr = Wo; 


Ape ele EG — vcos 4), 
ðtr wo 
Wy 
ot 


and since 6~ 1, the observed frequency will be predominantly in the 
range 


ee Wo a w 
a Z vy’? (1 ‘my Te ae 


(4.9.22) 


4.10. CERENKOV RADIATION 


A charged particle moving at constant velocity v in a medium in which 
cm, the phase velocity of light, is smaller than v radiates energy. 

That something peculiar happens under these circumstances can be 
seen from the Liénard—Wiechart potentials, where the denominator 
1 — f: v/cm is zero at an angle cos 0- = Cay/v < 1, and the corresponding 
potentials become infinite. Of course, the singularity is not really there; 
it appears as a consequence of assuming a dielectric constant that is 
independent of frequency, so that there is no high-frequency cut-off. In 
practice, as w —» %, the dielectric constant e — 1 and cm > c > v, leaving 
the total radiation finite. This makes it clear that we must consider fre- 
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quency-dependent dielectric constants e(w). We take u, the magnetic 
permeability, equal to 1. 

We consider a given frequency w. Maxwell’s equations for the wth 
components are? 


VxXB=-ion +474, Da (4.10.1) 
G c 
and 
YVxE= £B (4.10.2) 
C 


which for w + 0 impose the constraint equations 


V- D= 4rp (4.10.3) 
and 
V-B=0. (4.10.4) 


We introduce the potentials as usual: 


B=VXA (4.10.5) 
and 
pay, (4.10.6) 
C 


that, with (4.10.1) and (4.10.3), yield, for spatially constant e€, 


2 ; ; 
vatta- (2ga) (4.10.7) 
c c c 
and 
7 wo 47p iwfwei 
c € c\ ec 


The Lorentz gauge here is evidently achieved by setting the terms in 
parentheses in (4.10.7) and (4.10.8) equal to zero. 


For obvious reasons, we reinstate c in our equations. 
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The charge and current densities are given by 


p(t) = q | dt e'8(z — vt) 8°(p) (4.10.9) 


-x 


and 


x 


jo(r) = gé.u | dt e'"5(z — vt) &(p) (4.10.10) 


-x 


where p = 6,y + @,x and q is the charge of the (point) particle. We have 
deliberately omitted the conventional factors 1/V2m from (4.10.9) and 
(4.10.10) to save writing. They are reinserted in (4.10.34). Please note 
that p (the charge density) and p (the radius in the x, y-plane) are totally 
disconnected entities. (We drop the w subscript from now on.) 

Thus, 


p= “exp (2) 5°(p) (4.10.11) 
v Uv 


and 
j=8q exp) 8°(p). (4.10.12) 
U 


The vector and scalar potentials satisfy the equations 


4 l 
(V? +k) = — 1R, 8? (p) exp( =) (4.10.13) 
C u 
and 
i I 
(V2 + e)e = — “4 (p) exp( 2) (4.10.14) 
€U Vv 
with 
2 2 
PaaS (4.10.15) 
C CM 


The solution of the equation 


(V? + k*)h = —4ap (4.10.16) 
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corresponding to outgoing waves we have already seen by Fourier transfor- 
mation of the retarded Green’s function. It is 


eitte 
oe fa ker p(r’). (4.10.17) 


Solving (4.10.13) and (4.10.14) via (4.10.17), we find 


tk{r--r’| ; , 
A= te. E 5°(p’) exp( =) dr’ (4.10.18) 
c jr -r'| U 
and 
q eiker’ yi iwz’ , 
b= 8°(p’) exp{ ——] dr’. (4.10.19) 
ve} |r—r'| v 


We carry out the trivial p’ integral and change variables to z’ — z = 
pu. There results 


A= exp(“2) I (4.10.20) 
c U 
and 
b= L exp( 25) I (4.10.21) 
ve U 
where 


1/2 
exp] ike (1 +u’) + it u || 
u 
I= | du n ae (4.10.22) 


Since we are looking for radiation, we go to large p and approximate 
I by the method of stationary phase. That is, we look for the value of u, 
uo, for which the phase of the exponential is stationary. If there is no such 
point, then the integral goes like 1/p for large p, whereas with the cylindri- 
cal geometry, fields must go like 1/p'” to radiate. The stationary point is 
given by 


=0 (4.10.23) 


ð 12 œ 
male + u?) +u] 


40 
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Oc 
Zo Pitol Z 
Figure 4.4. 
or 
CM Hn 
=- ; (4.10.24) 
v (1 + u3)'? 
Thus, to have a stationary point, we must have v > ca, and 
c 
Up = — a (4.10.25) 


(v? EEA 


Evidently, uo corresponds to the Cerenkov cone. Recall that 
Z — Z = ply, Where zo is the point from which radiation emerges to arrive 
at p, z as shown in Figure 4.4. 


Since 
1 
tanéd.=-—, 
Ug 
cos b. = : = [uo] sM 
1 u2 l+ua v 
T 
Uo 


as expected. 


We expand the phase about uo. Setting u = uo +s, we find for the 
integral 


2 2 \1/2 2 2 2 \3/2 
: v“ — cum) s“ (v-ce 
exp ike Mi 4 ( eM) + O(s?) +++ |) 
pe v 2 v ds 
[1 + ud + 2uos +- --]'7 i 


(4.10.26) 


In the form (4.10.26), it is clear that only values of s of order of or less 
than 1/Vp make significant contributions to 7. Hence, we can drop the 
extra terms in the denominator and the ps* terms in the exponent; these 
give corrections of order 1/Vp. The final answer is then 
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2_ 2 \12 7.2 212 27,2 _ .2 \3/2 
[= exp( ikp cm) S cm) ds exp( i i ieee cu) ), 
u u 2 u? 
(4.10.27) 


Or 


w? - cy)! 


l= 


AAT v? 1/2 (v? = ci)? 
(1+ |Z a exp ik =), 
(4.10.28) 


which is accurate to order 1/Vp. 
The final expressions for the potentials are 


A r EN E E Sr 
a= &: exp (i ez) exp( skp“) Io (410.29) 
C U v 
and 
2_ 2 \12 
=£ exp( i ez) exp( ikp net) lo (4.10.30) 
ve v Uv 
with 


1/2 


= (1+ lz ae (4.10.31) 


The fields are given in the p — œ limit by 


2_ 2 51/2 
B=YxA =; CM] ax ê, 
C 
and 
E=i2A- Vo 
c 
or 
a2 (ite. 7 Le) Ie (4.10.32) 
Cc V ve 
= £09 e (v? = ch) he”, (4.10.33) 
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where 


2 .2\u2 
4 = WZ + ge cm) , 
U U 


The Poynting vector, integrated over time, is with our normalization 
of Fourier components’ 


p=-2 fee”. (4.10.34) 


T 27 


The energy flux per unit length through a cylinder at radius p is then 
independent of p and equal to 


2 2 
ar = © | dojol( - ae) 5 (4.10.35) 
Ze 52 U 


where the integration over w is limited to values of œw for which 
culu’ <1. The absolute value |w| comes about because the expression 
for |/o|? has a term 1/k that must be interpreted as |k]. 

Integrating over positive frequencies only, we may drop the factor 
of 1/2. The number of photons radiated per unit frequency and length is 
obtained by dividing by h|w];: 


=0, V < Cm. (4.10.36) 


CHAPTER 4 PROBLEMS 


4.1. (a) Show that a function y that satisfies V°y = 0 in a region can 
have no maximum or minimum in the region. 
(b) From this, show that a finite function that satisfies V7 = 0 
everywhere and approaches zero as r — œ is zero everywhere. 


°We carry out the calculation inside a dielectric cylinder. Since the tangential compo- 
nents of E and B are continuous at the dielectric boundary, the E x B flux through any 
cylinder correctly calculates the radiated energy. Remember that g = 1. 


4.2. 


4,3. 


4.4. 
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(c) From this, show that a vector field whose divergence and curl 
both vanish, and which approaches zero at œ, is zero. 

(d) From this, show that a vector field that vanishes sufficiently 
rapidly and smoothly as r— œ can be written as the sum of a 
longitudinal field (with zero curl) and a transverse field with 
zero divergence: V = V; + V,. 

(e) For such a function, give a general integral formula for V; and 
V, as functions of V - V and V x V, and show the limiting be- 
havior of V; and V, as r— ©. Give sufficient conditions on the 
large r behavior for your results to hold. 


Verify explicitly that the E and B fields calculated from A and ¢ in 
the transverse gauge are equal (for all r) to those obtained from A 
and ¢ in the Lorentz gauge. 


Write an integral formula, analogous to (4.3.10-4.3.13) for the 1/r? 
correction to A and ¢ at large r. If the characteristic radius of the 
charge and current distribution is b, estimate the order of magnitude 
of the correction compared to the 1/r term. 


A unit point charge oscillates in one dimension with amplitude b 
and frequency wo: 


x = b cos wot. 
The charge density is a periodic function of time: 
p(x, t) = (x — b cos wot). 
Expand p(x, t) in a Fourier series: 


p(x,t)= È a, en" 
1==% 


r 


and find a general formula for a„. Check your algebra by calculating 
the monopole, dipole, and quadrapole amplitudes: 
b 


M= [ dxat = bcos wot) = 1, 


—b 
b 


D= f x (x — b cos wot) = b cos wot, 


—b 
h 


Q= f x? 8(x — b cos wot) = b? cos? wot. 


+p 


4.5. A point charge q oscillates along the z-axis: z = bcos wot, y = 0, 
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4.6. 


4.7, 


4.8 


. 
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x= 0. Consider radiation in the direction @, g (the usual spherical 
coordinates). Assuming wob/c <1, give the angular distribution of 
the radiated power: 


(a) At frequency wo. 
(b) At frequency 2». 


A small magnetic dipole rotates in the x-y plane following the 
formula 


M = €, cos wot + Ê, sin wot. 
Give the electric field radiated in the direction k, or at angle 0, œ. 


(a) Give the polarization state of the electric field for k, = k, = 0, 
k, #0. 


(b) Do the same for k; = 0, kų, ky #0. 
(c) Determine the angular distribution of power radiated. 


Calculate the retarded potentials of a point charge moving with 
uniform velocity v and show that the result is the same as obtained 
in (4.6.6) and (4.6.7). 


The rate of energy radiation by a slowly moving charged particle is 
given by 


dW _2q° (2): 
dt 3 \dt 
This energy must show up as a loss of energy by the radiating 
particle. Show that a radiation reaction force 
_2q° d” 


3 e dt? 


inserted in the equation of motion of a confined particle will account 
on the average for energy loss by the particle, as long as the velocity 
and acceleration of the particle are bounded. Show however that f, 
inserted into the free particle equation of motion has unacceptable 
solutions. These are discussed in Section 5.9. 


Show directly that 


vli) = aman) f(r) + LLL 
r r dr 


and hence that 


Chapter 4 Problems 179 


; 
1 d? af: : a 
(v ) ——— = - 47 8°(r) f(t). 
r 
4.10. Consider the electromagnetic field in a vacuum inside of a perfectly 
conducting cavity or wave guide. With the vector potential in the 
transverse gauge, find the boundary conditions on the vector poten- 


tial at the conducting wall. From this, find the normal modes of a 
rectangular perfectly conducting cavity with sides a, b, and c. 


4.11. A charge distribution oscillates according to the formula 


3x? r 


or.) =( 3 =) FO) cos wr, 


where F(r)— 0 rapidly as r— œ. Give the angular distribution of 
the emitted radiation to lowest nonvanishing order in wb/c, where 
b is the length scale of the charge distribution. 


4.12. A point electron of charge e moves in a given path r,(t) = 
éa cos wt + é,b sin wt(é,, ê, are orthogonal unit vectors). 


(a) Write formulas for the charge and current densities p(r’, t) and 
jr’, ð. 
(b) Write an exact integral formula for j,(r’), where 


jr’, t) = > jar’) e "Or 


(c) Each current j,, now radiates a frequency w„ = nw, with a corres- 
ponding wave number k,, = n(w/c)r. The relevant amplitude jp, 
will be 


Ík, = | eta" hr’) dr’. 


Evaluate the r’ integral to obtain jx, expressed as a time integral 
over one period of the motion. 
(d) Do the final ¢ integral for n = 0, 1, and 2, in each case to lowest 
nonvanishing order in ka and kb, where k = o/c. 
From the n = 1 electric dipole vector potential, 
eilkr- an) 


Ai = jk, ’ 
cr 


calculate: 
(e) The electric field (in terms of w, a, b, k, etc.). 
(f) The same for the magnetic field. 
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(g) The Poynting vector averaged over a cycle. 


(h) The polarization of the radiated light is normally elliptic. Are 
there one or more directions of observation k for which it is 
plane-polarized? If so, what are they? For which is it circularly 
polarized? If so, what are they? 


Two electrons, each with charge e, move oppositely along the x-axis 
with simple harmonic motion x, = a cos wt, x2 = —a cos wt. Suppose 
walc <1. Calculate to lowest order in wa/c the radiated electric 
and magnetic fields and the angular distribution of radiated power. 
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CHAPTER 5 


Scattering 


Almost all physics experiments can be described as scattering processes: 
We start with initial objects (fields or particles) approaching each other; 
we end with final objects separating. Among the processes we took up in 
Chapter 4 several can be thus described: For example, in bremsstrahlung, 
an initial charged particle approaches a target; a final charged particle 
emerges, accompanied by a radiated electromagnetic field. In Cerenkov 
radiation the target is the dielectric. In most of the other topics, the 
connection to scattering is less obvious, but it is still present. Therefore, 
scattering is important in physics, and it makes sense to treat it as a 
separate topic. This is true although no new principles are involved. 
Indeed, the reader may omit this entire chapter without experiencing any 
consequent difficulty in understanding the rest of the text. 

The author’s recommendation to the interested, but not devoted, 
reader is to compromise by omitting Sections 10-12. In the first six 
sections, we study the general theory of scattering, illustrated by the case 
of a scalar field (or in quantum theory a spin zero particle). Included is a 
discussion of partial wave amplitudes that decouple when the system 
being discussed has spherical symmetry. Sections 7-9 concern the general 
formulation of scattering of the electromagnetic field, with two simple 
applications to weak field scattering, by a harmonic oscillator and by a 
dielectric with e — 1 <1. The remaining three sections, Sections 10-12, 
address the vector partial wave expansion and apply it to scattering by a 
dielectric sphere. This method is very important for numerical work in 
many cases where approximate methods are invalid. However, the discus- 
sion given here involves much more detailed algebra than the rest of the 
text and can be omitted easily in a first reading. 


5.1. SCALAR FIELD 


The electromagnetic field is most conveniently described by a vector po- 
tential A and the accompanying scalar potential @. It is called a vector 
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field. We consider here first a theory that depends on a single scalar 
potential y(x, £), which we call a scalar field. Although there is no such 
field known in nature, the theory provides a simple model in which the 
mathematics and physics are more transparent than for the more realistic 
vector and tensor fields. Nevertheless, many of the essential physics ele- 
ments that characterize the vector and tensor fields are present. It is only a 
minor complication to deal with a massive scalar field (quantum language; 
classically, we would say a field with a finite Compton wavelength), so we 
will do so. 

The wave equation for the field away from sources and scatterers 
(which we will always assume to be spatially confined) is taken to be 


2 
(v -pu - =) w(x, t) = 0, (5.1.1) 


where 1/y is the Compton wavelength of the field. As before, c = 1. The 
form (5.1.1) is, of course, suggested by the corresponding equation for 
the components of the electromagnetic potentials in the Lorentz gauge. 
We include the term x? since that permits the particles associated with 
the quantum field to be massive, with mass po = u/c. We will also see 
in Chapter 7 that (5.1.1) is the simplest nontrivial Lorentz invariant 
equation that we can write. 

In the presence of sources and scatterers, the right-hand side of (5.1.1) 
will be different from zero. However, in a scattering event, both the initial 
and final field configurations are far away from the sources, so that (5.1.1) 
is sufficient for our general discussion. 

The elementary, fixed wave number and fixed frequency solutions of 
(5.1.1) are 


dy (x, t) = ef 0) (5.1.2) 


where w = Vk? + 47; conventionally, we call the minus sign in e~'“' 
positive frequency. Of course, if the field y is real, y must consist in a 
superposition of at least two of the elementary solutions, ¢, and ġř. 

A conserved energy functional of the scalar field y, and a correspond- 
ing locally conserved energy density and energy flux, are permitted by 
(5.1.1).' The energy density is given in arbitrary units by 


uno K +W + a?ae] (5.1.3) 


We shall learn general rules for constructing such conserved quantities in Chapter 7. 
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and the energy flux (the equivalent of the Poynting vector) by 


1 ow 
P= - —— Vy. 
4r at v O29 


u and satisfy the conservation equation 


ðu 

—+V-P=0 (5.1.5) 
ot 

leading to a conserved energy in a volume V: 


Wy = | dew (5.1.6) 


V 


provided there are no sources of y inside the volume and the flux through 
the boundary surfaces is zero: 


| 9-as-o. (5.1.7) 


A scattering problem must specify an incident wave packet heading 
toward the target T, as shown in Figure 5.1. The vertical lines are meant 
to represent maxima of the amplitude within the envelope; thus, the 
distance between the lines is roughly Ao, where Ào = 27/ky is the mean 
wavelength of the incident field. The incident field Wo(x, t) is taken to be 


ox, t) = f alk — ko) el) e0 dk + c.c.. (5.1.8) 


Here, ko is the central wave number of the packet; we would refer to the 
scattering of this packet as the scattering at wave number ko, even though 
the packet involves a superposition of a continuum of wave numbers. For 
this terminology to make sense, the packet spread in wave numbers Ak 


N 


Figure 5.1 
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must be small compared to the characteristic wave number of the source; 
that is, Ax ~ 1/Ak must be much larger than the source size, or force 
range. Note that this creates special problems for Coulomb scattering. 

We choose a(k — ko) for simplicity to be a real, smooth function, 
symmetric in k — ky > —(k — Ko), with width Ak as stated above, Then at 
time ż = fy and with q = k — Ko, 


polx, 0) = e™ ~ h(x — xo) + c.c. (5.1.9) 


where 


h(x — xo) = f dq a(q) T~» (5.1.10) 


where h is real and symmetric under the reflection of x — xo. The mean 
value of x;, defined by 


faxx |yo(x, 0)|? 


x; 


s 


E 
will be xo;. The mean value of k;, defined by 


f dk kilat — hoy)? 


i ’ 


f axlatk - by)? 


will be ko;. The root mean square spread in x; will be 


1/2 
| dx(x; — Xo; y| ox, 0)(? 
Ax, . (Nosum overi) 


| axl vats, 09)? 


The root mean square spread in k; will be 
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1/2 
f dk(k; — koi} |a(k — ko) |? 
Ak; = . (No sum overi). 


f axlatk - kol? 


Of course Ax; and Ak; have the uncertainty property 


E 
> 
= 
NV 
Nie 


To study the time dependence of (5.1.8), we expand w in powers of 
q = k — kp; thus, 


1 
w(k) = w(ko) + q: Vko% (Ko) + 74 . Viod a Vo (ko) +». (5.1.11) 


and 


hal Ija e E AE f dq a(q) 


x ella lx xo —v gl ta] ADA Ve TVk OKO tg) +} + c.c. (5.1 12) 


where the group velocity 


Ko 
(Ko) 


v = Volko) = (5.1.13) 


The last term in the exponent can be neglected if 


2 2 
Taua op E ay. 
(o ko 
where L is the distance we may allow the packet to travel between observa- 
tions. We recall that Ax {= 1/Ak > size of the target; hence, since 
Ak/ko <1, we can always choose L so that the initial (and final) distances 
to the target are much larger than the target size. Neglecting the last term 
in the exponent, we find for (5.1.12) 


Wo(x, t) = eTo Oxo) 0-0] h(x — xo — v(t — to)) + c.c. (5.1.14) 


= 2 cos(ko * (x — Xo) — wolt — to) A(X — Xo — Vg(t — to)). (5.1.15) 
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Thus, the packet envelope moves rigidly, without changing shape. 
To find the energy incident per unit area, dW/dA, we must integrate 
the flux over time: 


maL Í dt? Yio. (5.1.16) 


We note that since ko > 1/Ax, the gradient in (5.1.16) acting on A is 
negligible; similarly, since wo = kolvg > Ak/u, = (1/Ax)v, ~ (i/h)ðhlðt, 
the time derivative in (5.1.16) acting on A is negligible. Finally, since the 
envelope function A varies negligibly in a period 1/wo, the integral in 
(5.1.16) averages the trigonometric function over time. There remains 


x 


dw 1 
AA = Feo | dt[h(x — Xo — Ve(t a to))}? (5.1.17) 


—æ 


If we take the target to be located at x = 0, xo and v, must be parallel; 
otherwise, the packet will miss the target. Call that direction z. Then 


h(x — Xo — V,(t — to)) = A(p, 2 — Zo — U,A(t — to)) 


and 
dW woko 
Lok | amp, z- zo — valt — to) 
2f 
= “ore | dz h?(p, z) (5.1.18) 
21 
where 


p= ĉênx +ê, y. 


With p located at the target transverse coordinate, that is, p = 0, we have 


W 


20 F 

woKo 2 

— | dzk (0, 5.1. 
dA 2a f AA] (ena 


energy incident on the target. 
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A shorthand for obtaining the result (5.1.19) is to consider only the 
positive frequency part of Yo, Yo+, and then calculate the integrated flux 
as 


EJ 
M -2Re — f ate Vy, (5.1.20) 


We shall use that procedure from now on. Thus, we consider in the 
following only the positive frequency part of w: 


y, = f dk a(k — ko) ilk: (x9) — ltt) — gilko Ce x9) woro) 
x h(x — Xo — Ve(t — to)). 


After the scattering is over, there will be an outgoing spherical wave 
ws. and the forward-going residue of the incoming field Yo. Figure 5.2 
illustrates the configuration. 

The wave field far from the source after the collision is given by 
the retarded Green’s function, Ag(x — x',t — t'), acting on the source, 


Figure 5.2 
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whatever that source may be. Ag is the retarded solution of the inhomo- 
geneous equation 


2 
(v gee =) Ar = —478°(x — x')8(t- t') (5.1.21) 


which is shown in the next section to be 


Ar(r, t) = | oe eA) (5.1.22) 


where 


A.(r) = (5.1.23) 


with k? = w° — uw’, and k/w > 0 for |w| > u, k = iV? — w? for |w| < p. 


5.2. GREEN’S FUNCTION FOR MASSIVE SCALAR 
FIELD 


The equation to be satisfied is (5.1.21): 


(v =p? =) Ar(r, t) = —478°(r)8(t). (5.1.21) 


As in Section 4.2, we proceed by carrying out a four-dimensional 
Fourier transform: 


Ax(r, t) = f o T Ar(k, w) (5.2.1) 
and 
8?(r)8(t) = lene feed. (5.2.2) 
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There results 


4n 


Miko) == 2.2.3 
r( ) (w? Ss u? = kô) ( ) 
and 
i(k-r— wt) 
Arlt, t) = lim = -4r | Ee E (5.2.4) 
e>0+ (277)* (w + ie)? — p? — k? 


where the e > 0+ limit ensures a retarded solution. 

Unfortunately, the integral in (5.2.4) leads to a more complicated 
function than the zero mass case. However, the w Fourier transform is 
very simple. Thus, instead of integrating over w, we integrate over k: 


1. ay k dksin kr 
antes) = — Stim | dn | oe 
=æ G 


æ æ 


l kdk e" 
dwe ™® | ——— 5.2.5 
J = keea on 


= lim 


«0 2r? 


ri 


We proceed by carrying out the k integration. The integrand has poles at 
k=V(w + ie)? — p? = Vw? — p? + ie'w 


and at k = —(Vw? — u? + ie’w), where the infinitesimal e’ has the same 
sign as e. Since we intend to close the contour above, only poles in the 
upper half-plane will contribute. These are, for w > p, k = Vw? — uw’; for 
w<—p,k = —Vw?— u? and for -u < w < p, k = iV p? — w?. The end 
result is 


oO 


Ar(r, t) = . f a (5.2.6) 


— æ 


where k = (w? — u?)!? for w >p, k= —(w? — p’)? for w < —p, and 
k=i(p? — w°)? for ~u < œ < p. The function k so defined is analytic 
in the entire plane except for branch points at w = +u. The cut is taken 
between the branch points; the definition of k informs us that the w 
integral goes above the cut. 

We list a few properties of (5.2.6). First note that for u =0, Ar = 
(1/r) &(t — r), as it must. 
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Second, for r> t, including all t<0, the integral vanishes. This is 
shown by closing the w contour in the upper half-plane. 

Finally, we note that for t>r, Ar is nonzero and not particularly 
simple. 

We are now in a position to discuss the scattering as a radiation of a 
scalar field by a source S(x, t). Equation (5.1.1) becomes 


2 
(vu = a > n?) kesdys (5.2.7) 


where S may be a given source, in which case we could study the radiation 
from the source; or if we are considering scattering of the field by a 
potential, the source S would be a linear function of the field itself, and 
(5.2.7) would become a Schrédinger-like equation for the wave function 
(field amplitude). 

The retarded propagation problem posed by (5.2.7) is solved by inte- 
gration: 


W(x, t) = Wo(x, t) + fa dx' Ar(x — x,t — t’) S(x',t'), (5.2.8) 


where Wo satisfies the free equation (5.1.1). In a scattering problem, Wo 
would describe the incident wave. 
Suppose now that S(x’, t’) contains only one frequency, so that 
S(x’, t’) = e ™ S(x’). 


Assigning the same frequency w to y and wo, we have the result 


Pt = pae p= paxe", (5.2.9) 
and 
t iw (t — t’) + ik'|x—x'| 
(x) = box) Bg er f dt' dx’ do'e S ig V) S(x’) eviwt 
Qa |x — x’| 
or 
ik|x—x’| 
(x) = dota) + | a’ e (5.2.10) 


At this point, we have made contact with the work on electromagnetic 
radiation in Chapter 4; in that case, there was no incoming field do, and 
the source S(x, t) consisted of given charge and current densities. The 
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field at distant x was given by replacing 
eikix —x'| ikr 


b eke 
{x —x’| y r f 


accurate to order 1/r. Evidently, the same expansion works for radiation 
of the y field by a given source. In a scattering process, the source is 
affected by the incoming field. 


5.3. FORMULATION OF THE SCATTERING PROBLEM 


In a scattering problem, o would specify the incoming field— position, 
velocity, wave number and shape of the packet, as described in Section 
5.1. In a linear system, the source S would have its frequency determined 
by the frequency of Yo. A Schrödinger-like model, for example, would 
have the wave equation 


2 2. 8 as, 
V- (a + UP -=s]y=0 (5.3.1) 


where u is the particle mass. For U < u and the frequency E = p + W, 
W < u, (5.3.1) becomes 


y2 
(->+ u)v= Wu (5.3.2) 
2p 


which is the Schrödinger equation. The source S(x) from (5.3.1) is 


_2pU+ U? 
An 


S= Y (5.3.3) 


and (5.2.10) becomes an integral equation for ġ. 

The exponential dependence e**” is called an outgoing wave (remem- 
ber that the time-dependent factor e~‘“‘ is appended to the wave function 
and that k has the same sign as w; hence, an outgoing wave). 

The standard procedure is to solve the integral equation (5.2.10) for 
an incident plane wave, ġo = e’“™. In principle, one would then construct 
a wave packet superposition of the plane waves, as described in Section 
5.1. In practice, as we shall see, the calculated cross section is substantially 
independent of the structure of the wave packet. 
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We proceed as follows: The asymptotic (r > ~) solution of (5.2.10) is 


e 


$(x) = e+ 


fÈ, k) + o(3). (5.3.4) 


r 


The function f(?,k) is called the elastic scattering amplitude at wave 
number k and angle f. 


The wave packet superposition is given, as in (5.1.8), by 
Wo (x, t) = | a e'o — 0) a(k — Ko) + c.c.. 


We have seen in (5.1.15) that, for times of interest, the wave packet 
moves rigidly without changing shape: 


tho(X, t) z eio) wolt) h(x — Xo — V(t = to)) + C.C, (5.3.5) 


and that the total energy incident per unit area at the target (p = 0) is 
= pa f dt M2 Oy, = shee) | |A(O, 2) dz, (5.1.19) 
4r at 27 


provided S— 0 rapidly enough as r — œ. We assume that to be the case, 
here and in the following. 


The scattered wave packet will be given by the superposition of outgo- 
ing waves: 


eiren 


Pee = fax et a(k — ko) fÊ, k) + c.c.. (5.3.6) 


r 


We carry out the same expansion about k = ko for W, as we did for yo. 
The difference appears in the expansion of k vs. that of k: 


k= Vk =v (ko + k — ko)* = Vka + 2ko: (k — ky) + (k — ko)” (5.3.7) 


= kofi + EEZ y (C r) (5.3.8) 


2 
0 ko 


= ky + ko + (k — ko) + o( fe), G2) 


0 
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We neglect the quadratic term in (5.3.9), as we did earlier in (5.1.12). 
Note that the neglected term here is [(Ak)?/ko] r; in (5.1.13) it was the 
equivalent expression [(Ak)?/@o](t — fo). We also assume that f(?, k) varies 
negligibly over the width of the wave packet. This is consistent with our 
earlier assumption that the coordinate spread in the wave packet be much 
larger than the size of the target. The result for Wc is 


“a A 
h(kor — xo — Velt ~ to)) fÈ, ko) oe. Y(t — wo) fF, ko) eilkor—ko-%0— wolt] 4. ec. 
r 


Wee = 
(5.3.10) 


Note the role of the wave packet function h. Since Ko, Xo, and v, = 
{ko/wo are all in the same z direction, the function h is evaluated at p = 
0, just as in the incident packet. The outgoing integrated energy flux per 
unit area is 


| Bear IE, ko SE | |m(0, r= zo = velt to) | 
n 
= | f(F, ko)l? aN |n(0, z)| az. (5.3.11) 


The cross section do for scattering into a solid angle dQ is defined as the 
ratio of the energy scattered into dQ divided by the energy incident per 
unit area on the target. The differential cross-section da/dQ. is defined by 
the equation dø = (da/dQ) dQ (note that do/dQ is not a deriva- 
tive!), so that 


f Pdt rr? 
do oo A 2 
aZ =a = = dQJ fÊ, ko)|?. (5.3.12) 


do 5 
| Po dt - ko 


Thus, the scattering problem can be stated in two ways: 


1. Solve the integral equation 


ik|x—x’'| 


(x) = eat S(x’) dx’ (5.3.13) 


[x — x'| 
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with S a linear function of ¢ as in (5.3.3), with S replacing S and 
@ replacing y; from (x) determine S and from S$ calculate the 
scattering amplitude 


fÊ k) = [anse dx’. (5.3.14) 


Or, equivalently, 
2. Solve the differential equation 


(V? — p? + w°) h(x) = —4rS (x) (5.3.15) 


subject to the boundary condition 


ikr 
y €e A 
Ur on > e™* + fÊ, k); 
r 


from the solution determine the function f, which is the scattering 
amplitude. 


5.4. THE OPTICAL THEOREM 


An important theorem, applied to elastic scattering, relates the imaginary 
part of the forward (6 = 0) scattering amplitude to the total elastic scat- 
tering cross section 


mfo =0)= Ž (aa, (5.4.1) 


In fact, the theorem is more general. On the right of (5.4.1) in the 
case where there is absorption of energy, there should be the total instead 
of the elastic cross section. That is, the general theorem says 


Im f(@ =0)=Sor (5.4.2) 


where f(@ = 0) is the forward elastic scattering amplitude and ø 7 the total 
cross section. 

In the case of vector (or higher) fields, forward signifies not only zero 
deflection angle @, but identical polarization to that of the incoming field. 
We will see this explicitly when we discuss scattering of electric and 
magnetic fields. 
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The theorem follows directly from the energy conservation equation 
(5.1.5) 


V- P+ Hu (5.1.5) 
or 


[@-ds--£ fuar. (5.4.3) 
dt 
Vv 


Ss 


If we consider a given frequency 


y= yp, typ- 
with 


wy. = ed and y- = el? 'b* 


then the time average over a cycle of (5.4.3) tells us that 
Í P.dS=0 (5.4.4) 
since u is periodic in ż with period m/w. On the other hand, 
P= (re = p-p + y) (5.4.5) 
T 
and the time average Pis 
Ga ee yee hee. (5.4.6) 
4 
so that energy conservation takes the form 
4n [a ; 
tz [Bas = -i | 6'e- aS + c.c. =0. (5.4.7) 
w 


Equation (5.4.7) is equivalent to the optical theorem. To proceed, we 
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note that in the surface integral above, terms of the form 


1 


| e '*™ dw g(r, w), 


=1 


following an integration by parts acquire an extra power of 1/r. Here, w = 
cos 6, and g is assumed to be free of singularities in the physical region, 
-1 =w <1. It follows that, as r >, we may write 


ikr 
d —e™*+ ie (5) (5.4.8) 


and ignore the 1/r? term. Inserting (5.4.8) into (5.4.7) gives, as r— œ, 
accurate to order 1, 


å —ik-x en . itk-x ake 
—| dS-ile + f* |like™™ + iF fltec. =0. (5.4.9) 
r r 


With dS = r*7 dQ. (we are, of course, integrating over a sphere), (5.4.9) 
becomes, accurate to order 1/r?, 


* 2 
faafe. k+ k p eir- + ar ek etrn 4 KA + a =0. 
r r r 
(5.4.10) 


The first integral is zero. In the second integral, we integrate by parts: 


1 


2a 
faa e f= faf dw e`" fiw, p) (5.4.11) 
0 =l 


where w = cos 8. 
Let e™™" dw = dv; u = f. Thus, integrating by parts, we obtain 


and to leading order in 1/r 
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wat 
1 
+ (5) 
w=~1 r 


- -|Æ m, H-E, |. 6413 


1 


| ave fiw, 9) = -en 
ikr 


-1 


Since at 6 = 0 or 7 (w = 1 or —1), ¢ dependence must disappear, we have 
for (5.4.11) 


| dQe **f= — T efo = 0) — eF f(0 = m)) + (5). (5.4.13) 
Similarly, 
faos: bie = 2m (ef *(6 = 0) + e' f*(@ = m7) + (3). 
ir r 
(5.4.14) 


Inserting (5.4.13) and (5.4.14) into (5.4.10), we have 


_ 2af(6 = 0) oie Os me pm @ =0) 


ir ir 


+ aT pac = me + Ly. dQ+c.c,=0. (5.4.15) 
ir 4 


The contribution from @ = m is imaginary. This leaves, after we add the 
complex conjugate, 


k 
Im f(6 = 0) = — o, (5.4.16) 
4r 
as expected, with g, the elastic cross section, 


o= | dQ = [irPaa. (5.4.17) 


198 Scattering 


5.5. DIGRESSION ON RADIAL WAVE FUNCTIONS 


We are about to take up the description of scattering in a system possessing 
spherical symmetry. For such a system, the angular dependence of the 
wave functions and scattering amplitudes can be expanded in a series of 
spherical harmonics. The coefficients of the spherical harmonics are called 
partial wave amplitudes. For a wave outside of the region of interaction, 
these partial wave amplitudes involve a specific set of radial functions 
called spherical Bessel functions. Since these functions appear in a large 
class of applications, we treat them in a separate section. 
We first study the Green’s function 


ik|r—r’| 
Arr- r') = (5.5.1) 
Ir-r] 
for r>r'. As r—%, we know the leading term is 
e Eas 
Ap(r — r') > —e ** with k = k?. (5.5.2) 
r 


We note that for arbitrary r, r’, but r + r', Ag satisfies the homogeneous 
wave equation 


(V2 +k’) Ag =0 (5.5.3) 
as well as 
(V? +k?) Ag =O. (5.5.4) 


We expand Ap(r,r’, w = cos 8) in Legendre polynomials: 


Ar(r —r') = 2 PAw) gelr, r’) (5.5.5) 
where now from (5.5.3) 


a 20 e(€+ »] 
“gt ntk =g 5.5.6 
E ror r? ee ( ) 
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and from (5.5.4) 


a 24a e(€ + 1) 
+——+ kh? -Ajg 5. 
Fz r'r’ r”? Jee p 5957) 
but where 
e" 
8e > ; fe(r’) (5.5.8) 
as r > œ, and 
1 
bee + 
felr') = Í dw Pe(w)e `” e(a, (5.5.9) 


-1 
The solution of (5.5.6) subject to the boundary condition that it 


approach (1/i‘*')(e“/kr) for large kr is called he(kr) (spherical Bessel 
function of the third kind). Hence, 


ge= ki helkr)- a f dw P-(w)e7" *” (5.5.10) 


—ir'kw 


_ The dw integral, for large r’, can be estimated by integrating by parts: 
ei” *™ dw = dv, Pe(w) = u, so 
E 
eames) 
—ikr’'\-1 \r' 


wF é 
= CD sin( kr’ = ==) + o(-5)- 
kr' 2 r' 


The solution of (5.5.7) that approaches {sin[kr’ — (£ m/2)]/kr'} as kr’ > 
is called j-(kr’) (spherical Bessel function of the first kind). Thus, 


| dw P(w) e7"” *” = Pe(w) 


ge = i(2€ + 1) kh kr) j kr’) (5.5.11) 
and, forr>r’, 
eiker 
a ik > (2€ + 1) Pe(w) helkr) je(kr’) (5.5.12) 
Pave é 


= Amik X ¥¢m(Q) YE m(O') he(kr) je(kr’). (5.5.13) 
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Also, by taking the limit r— on both sides of (5.5.12), we find 


e™t = X (2€ + 1) if Pe(w) je(kr’). (5.5.14) 
€ 


The functions hy and je are interesting and quite easy to study using 
methods similar to the standard quantum mechanical treatment of the 
harmonic oscillator and angular momentum. Recall the definitions: 


2 + 8 
{- 4-24, ED ef] ao (5.5.15) 
dr rdr r Je 
and, as År > œ, 
1 eikr 
ie kr 
Pg- l en\s. (5.5.16) 
je sin{ kr- — 
2 
kr 


The more convenient functions are 
u¢(kr) = krje(kr) and we(kr) = krhe(kr). (5.5.17) 


Note that this notation (u, and w,) is not standard. 
u;(x) and w-(x) satisfy the equation 


-i [Mt ao, (5.5.18) 


dx’ x Ue 


the boundary condition u¢(0) = 0, we(x > ©) ~ e’*, and the normalization 
determined by 


as x >, Here, x = kr. 
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We solve (5.5.18) recursively by factoring. Call the operator 


—(d7/dx?) + [EE + 1)/x*] = He, 


aealg. and Pee 
dx x dx 
Then 
d A/d £ 
war- 9E- 
os dx x/\dx x ý 
and 


AcAé =Hey 
so that, if Hepe = We, thus satisfying (5.5.18), we have 


Ai Ache = be 
and 
AACA he = Ac We 
and hence 


Hei (Ae we = (Ache) 


so that A; is a lowering operator, that is, it takes ye into Ye- 


Similarly, A7,,1A¢+; = He, so that 


AG Ate = he 
and 
Ab Aer Atre = Abs ie 
or 


Hes (Aziwe) = (AfiiWe) 


so that Az, is a raising operator, that is, it takes W- into Yeri- 


(5.5.19) 


(5.5.20) 


(5.5.21) 


(5.5.22) 


As x, A* > -d/dx, A` > d/dx; acting on the asymptotic forms 


for uç and wy, we see that 


z d . ( n) ( m) ; ( eo) 
A ue > — Sin| x — — | = cos x ~ — | = sin| x - =— 
dx 2 2 2 


so that the raising and lowering operators maintain the correct asymptotic 
limits and therefore the correct normalizations for the functions we and 
Ue. 

We now simply construct the functions starting from the solutions for 
£=0: 


Wo(X) = 


= (5.5.23) 
L 


and 


Uo(X) = sinx. (5.5.24) 
Thus, 


mozam- (£8) aed) 


Ix 


u(x) = A` uox) = (-¢ + *| sinx -cosx + snx , 
dx x z 


We note general properties of we: we = e™/i®*! (1+ ascending powers 
of 1/ix, the last being 1/x°). ue = sìn x and cos x times ascending powers 
of 1/x, down to 1/x‘, with ue odd or even in x according to € even or odd, 
and going like x“*! as x > 0. This last result can be proved by induction, 
using A7,, to raise €; however, it is also evident from (5.5.9) using the 
orthogonality properties of the Legendre polynomials. 

The coefficient of x‘*! for small x in uç can be calculated by induction: 


= d £ 
Arpe” (< + ‘) e= Pei 


so that if weoC,x**! as x0, be 1 Cell + Ix’ as x0 so 
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Ce_,/(2€ + 1) = Ce that, together with Co = 1, yields 


Bete 
(2€ + 1)!! 
and 
ye! 
He O04 DI! x0. (5.5.25) 


Here, (2€ + 1)!!=1-3-5...(2€+ 1). 
We can also calculate the coefficient of 1/x ‘ for small x in wẹ by using 
the raising operator AZ,, on we. We find, with we > B-e”/x‘ as x 30, 


Best e™ (2€ + 1) 


yt = yf} e"Be 
or 
Bes, = (22 + 1) Be. 
Thus, 
1 ~ 1)! 
hebe e es sy 22S 
i i 
and 


— 1)! e™ 
ura EEDE y et (5.5.26) 


i xf 


Of course, (2 — 1)!! = 1 for €=0. 


5.6. PARTIAL WAVES AND PHASE SHIFTS 


Given a source S in the form Uy, the resultant Schrödinger type equation 
is generally hard to solve for the scattering amplitude. There are two 
exceptions. The first is valid when the interaction is weak, in which case 
one can apply perturbative methods to the problem. We shall see some 
examples of this in Section 5.8. 

The second requires that the interaction possess spherical symmetry. 
In that case, one can use spherical harmonic expansions to reduce the 
three-dimensional problem to a set of one-dimensional problems—one for 
each £ value. We turn to that case now. 
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We have 
S(x, t) = Uw (5.6.1) 


where U is a real,” spherically symmetric linear operator, which becomes 
small rapidly away from the source.’ Since U is spherically symmetric, it 
is useful to expand the wave function and scattering amplitude f in 
spherical harmonics. That gives, for large r, 


y= e™* + E- Ae, k) + (5) (5.6.2) 
r r 
or 
sin(kr - S 
y> > (2 + 1i ——— Pê- R) 
€ kr 
ikr x 
+% E felt + 1) PPh) + o(3) (5.6.3) 
Ecg r 
where 
AG, K) = È fee + D Pê k) (5.6.4) 


defines fe and where the Legendre polynomial expansion of e™* makes 
use of (5.5.14). 
On the other hand, with 


vel 


r 


w= DA? 


2, P(o)(2e + 1), (5.6.5) 
as r—>&, u¢is real and must approach 
; Erm 
ve—sin{ kr — z3 + 8}, (5.6.6) 


thereby defining the phase shift 5,. The €77/2 is inserted to make 6,=0 
in the absence of interaction. Each partial wave function can have an 
arbitrary constant coefficient A, since the wave equation is homogeneous. 


See Problem 5.2 for a discussion of scattering by a complex potential. 
*For example, Uy = f U(x, y)y(y) dy, with U a real function of |x}, |y| and x: y. 
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That coefficient is determined by the incoming wave and retarded scat- 
tered wave boundary conditions implicit in (5.6.3). We find 


f _ et] el esin 8¢ Si 
Dik k (267) 


and 


fÊ, k) = > (2€ + 1) Pe(cos 0) e®¢sin c. (5.6.8) 


The scattering amplitude f satisfies the optical theorem. That is, using 
the result 


4T 
Pe Pe dQ, = —— bee, 5.6.9 
f e(w) Pe (w) +1 ee ( ) 


we have 
| | f(F, k)|? dQ = D (2£ +1) sin? ôç = = im f(8 =0,k). (5.6.10) 


Referring back to (5.6.3), we see that once fe is known, the wave 
function in an interaction free region can be extrapolated back to the 
point where the interaction becomes significant simply by replacing 
e'“"Ikr by he(kr) i**! and [sin(kr — €)/kr]w by je(kr). 

We may calculate the behavior of 5, for small k, assuming a source 
that is strongly confined to the neighborhood of r= 0. The scattering 
amplitude f will in scalar scattering normally depend on cos @ through the 
dot product k; - ky. Therefore, for small k we can expand f in powers of 
k; < k; = k? cos 0. This will lead to a series of the form 


ao + aik? cos 0 + ank* cos? 9+ ++ 


= aoPo + aik? P, + ank (222+ i Po) +o 


Clearly, the coefficient of P, is a power series in k, starting with 7^. Since 


etsin ôe 
k 


t= 


goes like k?‘ for small k, 5, must go like k7“*?. 


A more general way of looking at the low wave number behavior of 
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the phase shift is to introduce the logarithmic derivative € of the wave 
function at a point ro, outside of, but close to, the interaction region. The 


parameter é is then matched to the logarithmic derivative of the wave 
function vz that, when kr > £, goes over into 


sin(kr — er + be), 
2 
that is, 
ve = ud(kr) cos ôe + qelkr) sin ôe (5.6.11) 


for r outside of the interaction region. Here, we have introduced the 
function 


qe = —— (5.6.12) 


which goes for small x like 


-pu 
goar CEDE, 
x! 


The logarithmic derivative & is obtained by integrating ve out from 
the origin; with a short-range, energy-independent potential, € will have 
a finite limit as k > 0, obtained by integrating the equation 


d? £(€ + 
|- FE +U+ | ve= kv, (5.6.13) 


to the point ro where the match is to be made. 
The matching equation is [with u; = du¢(kr)/dr, q¢ = dq ¢(kr)/dr] 


Uug cos ôs + qesin ôe 


= : (5.6.14) 
ug cos ĝe + ge sin 5¢ 
or 
nese see Le, (5.6.15) 
qe qe 


The small k limit of 6, can be calculated from (5.6.15). For € approach- 
ing a finite limit as k — 0 (which as we shall see is not the case in scattering 
by a dielectric), we find, using the small x expansions of u, and qe (5.5.25- 
5.5.26), 
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(e eer *) 
ee | heed 1 1 


Erot € (26+ 1)! (2€- 1) 


Ôr > 


(5.6.16) 


Equation (5.6.16) is particularly simple for € = 0 {which is present for 
a scalar field, but not for the electromagnetic field). It becomes, for 
£=0, 


ps -k(ro 2 z) (5.6.17) 
2 -k(ro © a a (5.6.18) 


It is useful, following a method of Fermi, to parametrize uo(r) for 
k = 0 in the neighborhood of ro: 


Uo = b(r + a) 
vo=b (5.6.19) 
and 
1 Vo 
-= ae rota (5.6.20) 
E vo 
so that 
~ a (5.6.21) 


a is called the scattering length. The differential cross section at k = 0 is 
|a|’. Note that —a is the value of r at which the zero energy wave function, 
extrapolated from its value and slope at v = Uo, vanishes. 

We can illustrate with three cases, all for € = 0. The equation for vo 
is, atk =0, 


8 d’vo 
dr? 


+ Uv = 0. 


If U is negative (attractive in quantum mechanics) and small, vo will look 
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(c) 


Figure 5.3. 


like Figure 5.3(a), will vanish at negative values, and a will be positive. 
If U is not weak, as in Figure 5.3(b), it can turn the curve over. This 
corresponds in quantum theory to the existence of a bound state and in 
classical wave theory to the existence of a localized solution of the wave 
equation with œw < p. In this case, the scattering length will be negative. 
The third possibility is U positive (in quantum mechanics a repulsion), as 
in Figure 5.3(c). In that case, the scattering length is again negative. 


5.7. ELECTROMAGNETIC FIELD SCATTERING 


As with the scalar field, we construct the vector field as a superposition 
of a positive frequency field and its complex conjugate. That is, for the 
incoming field we have 


E” = E? + E° = E? + E% (5.7.1) 
where 
E? =e eh), 0*8, kso (5.7.2) 
and 
B° = BS + Bo, (5.7.3) 
with 
iwB? = ik x EÑ.. (5.7.4) 


The outgoing scattered wave for the E field will be 


e" z 1 
Es = f+ 6 (=) (5.7.5) 


5.7. Electromagnetic Field Scattering 209 


and 


oB, = iKEX Eye + (5). (5.7.6) 
r 


Of course, E? and f will be transverse. That is, 
k-E°=r-f=0. (5.7.7) 


The incoming time-averaged Poynting flux will be 


pg -1 pt xp, 
27 
= dpr x (* x E.) 
2r Ww 
or 
p= fern, =f. (5.7.8) 
20 2a 


The time-averaged scattered flux per unit area will be 


Br = px y (F x f+ (5.7.9) 
27 r 
KEN Ê peg (5.7.10) 
Qn r : a 


giving a scattered flux per unit solid angle 


and a differential scattering cross section? 


at 


=f*-f. 5.7.11 
10 ( ) 


“The wave packet discussion given earlier for scalar scattering evidently goes through 
equally well for the vector field we are considering now, with the final result being the 
justification of (5.7.11). 
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The polarization of the scattered wave is described PY the vector 
scattering amplitude f(k, f, €°), and is a linear function of ê @° . The function 
f can be resolved into any complete pair of polarization vectors €* , where 


AAI 


we would normally choose @** - @™ = ôx. Thus, 
2 
f= > Prf (5.7.12) 


and é**-f gives the scattering that would be measured by a detector 
detecting only the polarization state A. 


5.8. THE OPTICAL THEOREM FOR LIGHT 


As in our discussion of the scalar field, we make use of the surface integral 
of the Poynting vector, which satisfies the equation 


fo as=-4 faru (5.8.1) 
dt 
S v 
so that for a monochromatic wave, the time average 
[Pas == re [Et xB, -as =0, (5.8.2) 
T 
S 


where we will integrate over a distant sphere. For r— ©, we know 


elk’ 
E, = (2 e**4 fÉ pe + o() (5.8.3) 
and 
ikr 
B, -(k x elem + Pate eT + (5). (5.8.4) 
r r 


Equations (5.8.2), (5.8.3), and (5.8.4) then tell us that 


x 7 ikr S 
0=Re | dare Cae £ ) x (fxe e** + EXE aik a 


r r 


(5.8.5) 


We proceed as in Section 5.4, noting that 
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2 1 . : 
| dw e~*"” E(w) = — — (e~*"F(1) - e*"F(-1)) + (5). (5.8.6) 
ikr r? 
-1 
We have, with w =f- À, 
Re{ =f f* x (k x 6°) yar T- f* x (k x eS ese o "| 
i 


2 A : 
= EERE x FX Plua FO x FX Dense] 
1 


+ fanes} =o. (5.8.7) 
Combining terms and noting that f -k = +1 atw = +1, we obtain 


Re| Zs. a- 2-0) — 27 (pe . g? e72 
ik w=1 ik 


+ @0* 2 f e7*") 


+ fare] =0 (5.8.8) 


or finally, 


5 Oa (5.8.9) 


where, as before, Ge is the total elastic cross section, and @°* -f is the 
forward, polarization-preserving scattering amplitude. 


5.9. PERTURBATION THEORY OF SCATTERING 


We consider a situation where the source-field coupling is sufficiently weak 
so that we may calculate the charge and current distribution of the source 
induced by the field without taking into account the reaction of the source 
on itself. We illustrate with two examples: scattering by a damped oscil- 
lator and scattering by a dielectric with a dielectric constant near 1. 
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5.9.1. Scattering by a Damped Oscillator and Radiation 
Reaction 


We write the equation of the charged oscillator in a weak electromagnetic 
field: 


mi + Rx + kx = g(E(x, ) + x X B(x, 1). (5.9.1) 


Here, R is a damping constant that we will adjust to give overall energy 
conservation via the optical theorem. m and k are, respectively, the mass 
and force constant of the oscillator. 

The displacement x and velocity x in steady-state motion will be 
linear in the field strength; therefore, since the field is weak, we may 
neglect the x and x dependence on the right-hand side of (5.9.1). In this 
linear approximation, the scattering is independent of the strength of the 
field, which we normalize to unity. The incoming field Eo is then 


Eo = @°e (5.9.2) 


and the magnetic field is 


The incoming average energy flux per unit area is, with é°* - 8° = 1, 


en a (5.9.3) 
27 


The steady-state motion of the oscillator is given (in our linear ap- 
proximation) by 


x= Xye (5.9.4) 
with 
ge 
m 
Xo = : §.9.5 
i 2—w*—-iwoR ( ) 
m 


and wô = k/m. 
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The field radiated by the oscillator is then 


Xr(tr) 
r 


E“ = — (5.9.6) 


and the scattering amplitude f is given by 


2A 
f= —. (5.9.7) 


i, EEEE ee (5.9.8) 


Before discussing the result (5.9.8), we use the optical theorem to 
determine R(w). From (5.9.7), 


ce wR 


Im é° - f(6 =0) = (5.9.9) 
2 2_ ,Rw f 
wg ~ wo — i— 
m 
Since the optical theorem requires 
Imé°- £(@ = 0) =— Ca, 
4r 
we find 
w 8Tq wt 1 = gw’ wR 
4n 3 m? > Rop m > .Ra|? 
Wg 7 =i Wg -— o — 1 
m m 
or 


R= soe (5.9.10) 
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so that the damping term in (5.9.1) could be written as R-x= 
39%, which is a more usual form for the so-called radiation reaction 
force, f, = 3q°K. We see (as in Problem 4.8) that inserting the force f, 
into the equation of motion gives a correct overall energy balance for this 
situation. In general, for confined motion this will be the case. It is clear 
however that in the absence of a confining potential, the force f, gives 
nonsensical results. Thus, with the force constant k and the incident field 
Eo set to zero, (5.9.1) would be 


2: ag 
mk = 37% (5.9.11) 
which has the general solution 
x= Xo ~ “2+ (v.- %2)+ Sem (5.9.12) 
Y Y Y 


Thus, unless a) = 0, the motion explodes exponentially, and the formula 
makes no sense. Note that this problem is not resolved by a harmonic 
binding force. 
We return to 5. 9. 8) for the cross section. We see that the energy 
denominator 1/(w — w°) has been damped by the imaginary term 
—3i(w?/m)q*. Note that the scattering amplitude f at the resonant w = 
wo takes on the imaginary value 


independent of g and m. This is a characteristic of resonant behavior: The 
resonant scattering amplitude is given by the wavelength multiplied by a 
kinematically determined constant (here, 3). 

The damping constant determines the width of the resonant curve. 
The cross section as a function of w is 
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Pies Cw" 
el —_ 2n2 
2. ,22,0 R*(@) 

(w 0) + 2 

m 


with C a constant. For small R, the maximum is at w = wo: 


Cw? m? 
Omax = 2 + 
R-(wo0) 


(5.9.13) 


The cross section takes on half this value at 


or O-%W =-—; 


thus, the full width at half maximum is I = R/m. 
On the other hand, the possible decay constants of the isolated oscil- 
lator are given by the roots of the equation 


oda EO ay (5.9.14) 
m 
or 
w = ta~ iR(wo) 
2m 


giving a time dependence to the oscillation amplitude 

x= x(t = 0) eo reer. 
and an energy decay given by e '' =e”. Thus, the decay constant 
is, in fact, equal to the width of the resonance. 

There is unfortunately a third unwanted root of (5.9.14) produced at 
high w by taking the w° dependence of R into account; this root is closely 
related to the unphysical runaway solutions of the free particle equation 
found earlier. Note that this root is in the upper half w plane and, hence, 
produces acausal behavior in the scattering. We evade all these problems 
(without justification) by taking R(w) to be a constant R = R(w). Further- 
more, the energy balance now only works near the resonance. A justifi- 
cation of this procedure cannot be given within the framework of classical 
field theory. The contradiction between energy conservation and causality 
is a genuine difficulty of classical electromagnetic theory describing the 
interaction of electromagnetic fields with point particles. This problem (the 
unwanted root) does not appear in relativistic quantum electrodynamics; 
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however, other new problems do. For further discussion of this problem, 
see Section 7.5. 

We finally note the low-frequency limit of (5.9.7) for the scattering 
amplitude by a free particle, 


f=- Ler, (5.9.15) 
m 


and of (5.9.8) for the elastic cross section, 


2 2 
TEE sze) . (5.9.16) 


The expression (5.9.16) is called the Thompson cross section. This result 
was first used by Thompson to measure the number of electrons in carbon 
by X-ray scattering. 

Note also that the scattering amplitude of a bound electron goes to 
zero for small w like w?, accounting for the dominance of short wave- 
lengths in the scattering of visible light by air molecules and therefore the 
blue color of the sky. 


Scattering by a Dielectric with a Dielectric Constant Near 1 


Our second example is scattering by a dielectric with permeability u =1 
and a dielectric constant e near 1: € -1=47y <1. 
As always, we return to Maxwell for guidance: 


B= and V x B= —iwD = —iweE 


giving a single equation for E: 


yx 
vx ( E) = ~iweE (5.9.17) 


or 


V x (V x E) = wE. 
We expand about E = Ep, the incident field, and e=1: 


V x (V x (Eo + E,)) = &°(1 + (€ — 1))(Eo + Ey) 


5.10. Vector Multipoles 217 


where V-E,=0 and —V*E) = aE, so that Eo =@,e“**"°?. There 
follows 


V x (V X E,) = w’[E, + [(e — 1) Eo]] (5.9.18) 
from which we deduce 


V-E, = -V-[(e — 1) Eo] (5.9.19) 
and 


—WE, + V(V-E,) = w [E, + (e — 1) Eo]. (5.9.20) 
Substituting (5.9.19) into (5.9.20) yields 
(V-a?) E, = w*(e — 1) Eo + V(V-(€-—1)Eo). (5.9.21) 


We solve (5.9.21) with the usual retarded Green’s function. As r— ©. 


ikr td 
E >° | dxe mare aE Pea’) 


r 4r 


+ v(v ; (<a) | (5.9.22) 
or 


E =w? 


ee ~ik px’ [EZ 1 tk-x' (a gta 
faxe Ey (=) Ex — Kyky -€,), (5.9.23) 
r 4a 
yielding a scattering amplitude 
f= oF | dx’ 9 x(x) ars (5.9.24) 
where y(x’) is the dielectric susceptibility at the point x', y = (e — 1)/47. 


5.10. VECTOR MULTIPOLES 


We come next to the partial wave expansion for a vector field. As re- 
marked earlier, the method is essential for a large class of problems. We 
therefore take it up, even though the required algebra is quite complicated 
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(but elementary). The prerequisite groundwork was done in Appendix B 
and in Section 5.5. 

The mathematics is based on a nontrivial operator dyadic identity in 
three dimensions: 


3 xV 
er. V+LG Dee ee 
y? VL? i 


+ ; 
where 1 is the unit dyadic and L the quantum mechanical angular momen- 
tum operator: 


v 
L=rx-. 
I 


A vector field can be expanded as 


y= Yih + Lys + Py (5.10.1) 


To see that this is correct, we note that we can always expand 
V=Vuiit+VxQ (5.10.2) 
for some y, and Q. Equation (5.10.1) prescribes 
Q = ir — iLi — Vya (5.10.3) 


where y, does not affect V. We now show that any vector function Q can 
be expanded in the form (5.10.3). We introduce Spherical coordinates r, 
9, and p and orthogonal unit vectors Î, 6, @ such that f x 6 = ¢, 
6 x ĝ =f, and ¢ x f = 6. The gradient operator is 


„a 8a b ô 
Gopi gl Sg A (5.10.4) 
or r 00 rsin 0 dg 
and 
1 l/a ð 6 a 
L=1rxv=1($Ż- +), (5.10.5) 
i I 06 sinĝððe 


The expansion in question is then, with Q = FQ, + 6Q,+ ÊQ, and the Q; 
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arbitrary single-valued functions, 


; ð 
Derg (5.10.6) 
or 
ð lo 
sates Oe (5.10.7) 
sind dp r 06 
and 
1 
pren Mio a e (5.10.8) 


Clearly, one must choose w3 and yw, to satisfy (5.10.7) and (5.10.8). 
yn is then chosen to satisfy (5.10.6). So, our problem is to show that y3 
and y, can be found. 

We multiply (5.10.7) and (5.10.8) by sin 0. There results 


yg et a ape (5.10.9) 
06 ag 
Siaa Es op (5.10.10) 
ag 00 
where 
Wa oe we 
: 


and where the P; = sin 8 Q; are still arbitrary and single-valued. Since the 
operators sin 6(9/00) and 4/ag commute with each other, (5.10.9) and 
(5.10.10) can be solved algebraically: 


an rn (sin ihe, Pe) (5.10.11) 
we ere: ae 30 ag 
sin @— sin @— + —> 
6 00 ð 
and 
1 OP, 
j=- = (sino P,- A ) (5.10.12) 
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We factor out sin? 6: 


1 1 1 


a. ð a 1 ð. ð 1 a] sin?@ 
sin 0— sin 0— + — | —— — sin s-i] 
a0 ð ag sin 606 00 sin* 6dg 


so that 
ee 1/1 aP, 1 oP.) 
ee Le ee 5.10.13) 
Ya Aaa 30 sin? 6 ay ( 
and 
L( 1 ə 1 ore) 
en ay ee one). 5.10.14 
Hmn e ane ag ( ) 


where we recognize the form 


ð a 1 
m rr Gee E 
sin 600 00 sin” dg 


as the operator 


2 
LS p(-v pe +22) 
or ror 


Of course, one here obtains y, and W4 as a sum of spherical harmonics. 
Note that the operator 1/L” = 1/€(€ + 1) does not become singular since 
the two functions in parentheses in (5.10.13) and (5.10.14) have no £ = 0 
projection, and Pe and P, vanish at 6 = 0 and 7. 

We return now to the electric field. We consider first radiation by a 
prescribed current. The radiation equations for E may be obtained directly 
from Maxwell’s equations: 


V x E= iob, VX B= —iwk + 4rj, and V-E=47p 


so 


vx (HE) = -iok + 40 


Lw 


and 


VE + WE = —4rliowj — Vol, (5.10.15) 


together with the continuity equation, V - j = iwp. 


5.10. Vector Multipoles 221 


We solve directly for the retarded field: 


ik|x—x"] 
E= | dx’ 5 Ty liad) ~ Ve, (5.10.16) 


with the large r form 


ikr 
e! 
E = 


| dx' e~** Tiwj(x') — Vip(x')] + o(5). (5.10.17) 


r 


We now apply the expansion (5.10.1) to (5.10.16): 
vV L 
i w 


where L stands for longitudinal, M for magnetic, and E for electric. The 
normalization factor w is inserted for convenience. 
We solve for the w’s by three orthogonal projections. First, 


Vu, =V-E. (5.10.19) 


Since V - E = 47rp, we can solve (5.10.19) for Yz: 


p=- | ape (5.10.20) 


[ee] 


which has inverse power law behavior at large r and is instantaneously 

related to the charge density p. Of course, the electric field itself does not 

have such acausal behavior since the factor e’ in (5.10.17) guarantees 

proper retardation. This power behavior reflects the use of the potentials 

yw and must disappear (by cancellation) in a calculation of the field itself. 
We solve for Ym by projecting with L. We note that 


L-V="xV-V="-VxV=0 
L L 


and since V x L = -L x V — (2V/i), 


L-VxL=-LxV=-LXxL-V=-—iL-V=0, 
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and 
Lda, = LE. (5.10.21) 
Thus, 


Um =—5L-E (5.10.22) 


provides an expansion in spherical harmonics for Ym. Note again that the 
factor L in L- E precludes any problem with ¢ = 0. 


xV 
Finally, yg is projected with L 


so that 
iw 
; v2 2 x y 
NEE ee. Jos ge, , Ol) 
w? iw w 


The magnetic field is given by a formula very similar to (5.10.18): 


VxE VXxL CES ae V x (V x L) 


pet E YAL YL XU). (5.10.24) 
l@ iw Iw -w 
y 2 
Oe A (5.10.25) 
10 w 


Outside the charge distribution, 


y? = w and B= i Wm = Like. (5.10.26) 


Note the generalization of the relation (4.5.20) between electric and 
magnetic dipole radiation. We interchange electric and magnetic radiation 
by the transformation Yg = Ym and Ym = — pe, which produces E’ = 
and B’ = — 

We next calculate the potentials from (5.10.19), (5.10.22), and 
(5.10.23). We already know from (5.10.20) that 


yL =- f it B. (5.10.20) 


|x= x'] 
For Ym, we have from (5.10.22) 


1 


Um = 5L-E (5.10.22) 
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ik|r—r'| 
z a 7 rx Vje’) (5.10.27) 
D l eikir—r'i 
sala = ary) - [r x j(r’)]. (5.10.28) 


We recognize the source of magnetic radiation as —V -M, where M 
is the magnetic moment per unit volume. If we call —V’: [r' x j(r’)]/2 = 
pa(r’), we have 


ew eike] 
tu = 73 dt palT’) (5.10.29) 
L |r- r'] 
so that a Y; m expansion will give us Ym for all r outside the source, once 
we know the asymptotic form. We use (5.5.13), which gives 


Yem(Q) 


2 
Ym = 8Tiw 2 hehe) ey ra 


dr’ Ye m(O') jkr’) pur’) (5.10.30) 


for r outside the source. 
For wW,, we have from (5.10.23) 


be = Ta xV-E (5.10.23) 
1 w eio 


=o ar 
V7 iL? Ir-r] 


L’ x V: (ioj) — V’p). (5.10.31) 


Here again, we find a long-range instantaneous interaction arising 
from the Coulomb operator 1/V? in (5.10.31). Since (5.10.16) shows that 
such terms are absent in the electric and magnetic fields outside the 
source, we may set 1/V? = —1/w" in (5.10.31), confident that the residual 
Coulomb-like term will cancel against the field generated from W% 
(5.10.20). The cancellation clearly depends on the identity that justifies 
our calculation of Yz, Ym, and Yg. That is that the dyadic 


ViV+LSL-VXL-sGLxv=i (5.10.32) 
y E Dy? 


where i is the unit dyadic. This identity is not at all obvious and difficult 
to prove directly, but must be true in view of the completeness and 
uniqueness of the representation (5.10.1), subject to the usual boundary 
conditions. 
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We continue to work on (5.10.31). Since 


1 1 +y 
y = oF + eve (5.10.33) 
and 
e" 
(w* + V) — = -4r (r) (5.10.34) 
r 
(5.10.31) becomes 
where 
1 eiker] 
We, =~ 33 | ae L' x V'-j@’) (5.10.36) 
L Ir-r] 
and 
4r 1 E 
laea XV J. (5.10.37) 


Our result is contained in (5.10.36), the retarded field. The electric field 
generated by W+, must cancel that of , outside the source. That is, we 
must have 


V(r) + 


: x = Phr) = 0 (5.10.38) 
l 


wW 
for r outside of the source, with y, and Yp, given by (5.10.20) and 


(5.10.37), respectively. 
We recall, for a localized function f, 


tT fa) E ae (5.10.39) 
r -r 
so that 
dr’ 1 
bn= | roe oe (5.10.40) 


and for r >r’ 
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We, = 2 Lan) : 


em rl 0041) 


r'r YZ,(O') XV + j(r'). (5.10.41) 
We expand 


’ 1 u 1 , , pos 1 
Lx Vj == XV) xV- je) == [-Vr +20 +r VV] je) 
l L 
(5.10.42) 


We substitute V’ - į(r') = iwp(r’), integrate (5.10.41) by parts, and remem- 
ber that Vr’ Y %,(Q’) = 0. There results 


__ 3 Ym) OEE ET 
mend e | de’ ODRO 
-2 eMe | ar EYZ (0) pir’). (5.10.43) 


Note here that the coefficient of 1/€ vanishes at € = 0, so there is no 
singularity at € = 0. Since the electric and magnetic fields are generated 
from we, by acting with (V x L)/iw and V’L, respectively, the € = 0 com- 
ponent of Ys, makes no contribution to either. Note also that the magnetic 
field outside of the source generated by We, vanishes, since Vz, = 
there. We turn finally to (5.10.38): 


Ve yg, = Vo tV2(1+r2) ye =0 (5.1044 


where we have again made use of the equation V yeg, = 0 for r outside of 
the source. We find thus that the fields generated by y, and Wz, cancel 
outside of the source; only Yg, and Ym contribute. This is not the case in 
the source, as we shall see in Section 5.12. 

Our final result for pg, is then 


1 eiker’ 
Ye, = sfa Lx V'-j(r'); (5.10.36) 
L lr—r’| 


expanded in spherical harmonics, (5.10.36) becomes 


ade ee: 


em €(€ + 1) r mO | ar jelkr') Y mQ L x V'- j(r’). 


(5.10.45) 
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It is again the case that the asymptotic e“/r coefficient of each Yem 
produces a known r dependence, h¢(kr), as long as we stay outside of the 
source. 

We can see why wz is called electric by considering the small kr’ 
behavior of the integrand in (5.10.45). The algebra is almost identical to 
that leading to (5.10.43). The source function is 


LXV je) =X VK Vj 
L 


2i Lii x yy’ _ r'V"”) -j 
L 


=Å. yy -y?r +2V)-j 
l 


= lie -V+DV j- Vr- j); 
Finally, I 
Yr j 
L' x V'-j(r’) = o[(r’- V + 2)p] - ——. (5.10.46) 

i 


Now integrate (5.10.45) by parts, keeping only the (kr)‘ term in j¢(kr). 
Since V'?r'“Y,,,,(Q’) = 0, the last term in (5.10.46) makes no contribution. 
The first term gives 


(-3+2-r' ô j= -040r 


1 


so that the £, m amplitude is given in this approximation by 
| dr’ p’) r’ Y imQ’), 


appropriate to an electric multipole. Of course, the labels electric and 
magnetic are only labels. The importance of the scalar functions yx and 
wa is that they permit a spherical harmonic expansion of the vector field 
radiation to be made. We shall see, just as in the scalar case, that the £, 
mth multipole has simple angular momentum properties. 
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5.11. ENERGY AND ANGULAR MOMENTUM 


We define the multipole amplitudes as 


e e 
M 
X Ye maem and Ue 
t ém 


E 
Wm =? > Yema Em 
em 


r 


The average flux of energy is the Poynting vector, 


oz * 
P= Ree a D 
27 
With 
y 
E= Libby + — Lie (5.11.1) 
lw 
Bes ie (5.11.2) 
lo 


the radiated power is then 


* 
aM B- ds => | (Lyu + vs) x (HE yu- Lye) as. 
dt 2r w 


i iw 
(5.11.3) 


Since we only need the constant (as r — œ) term in (5.11.3), we may set 
V = ikf,sothat 


aw 1 


= +f (Lita + FX Lyr)* X (FX Lyu- Lc) îr dO, 
dt 2r 


(5.11.4) 
The purely magnetic term is 


oe A r’ dQ(Lm)* X BX Laas) î 
dt 27 


= 5 | Ciba) Lit) r?dQ (5.11.5) 
27 
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so 
Lee) E+ D|am. (5.11.6) 
dt 2m €m 
Similarly, 
MWe D eet ilainl. (5.11.7) 
dt 27 em 


The cross-terms between Ųp and Ym, when integrated over Q, give zero. 
We turn next to the radiation of angular momentum. Recall from 
(3.8.21) that the outgoing flux of angular momentum through a surface is 


F, = -ei | dSexTa (5.11.8) 


where Te is the Maxwell stress tensor 


1 1 
tee (Exe, + B,Be- = 8° — + 5,08"). (5.11.9) 
4a 2 2 


The terms with 5,, are orthogonal to the spherical surface, so there 
remains, in vector notation, 


F == | doce Bee xr +r- BER xr), (5.11.10) 
T 


Note that F appears to grow linearly with r, since the E and B fields go 
inversely with r. Hence, there must be a cancellation of one power of r. 
With 
v 
E=Lyy+— X Lys, 
lW 
and (5.11.11) 


Vv 
B= an x Liu — Lie, 


iw 
(since Ye, and i, cancel away from the source) 


L? 
r-E=* ýr (5.11.12) 
@ 
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and 


r-B=—uy (5.11.13) 
@ 


and the cancellation has occurred. 
There remain E x r and B xr: 


Ex r= -ùr x Lyu +r x (T) yn) (5.11.14) 


and 


nee ~[rx( E) yu =r x Lib. (5.11.15) 


lw 


Again, the gradients must act on e“’ to give ikf, so 


Exr=-(r X Loa tr x (FX L)yr) (5.11.16) 


and 


Bxr=—(r Xx (FX L)ym- r X Lyr). (5.11.17) 
Combining, we find for purely electric radiation an angular momentum 


Fe D 0+ 2)at Lmmtin (5.11.18) 
27 @ Emo 


and for purely magnetic radiation 


Fu = È E+ Dat Lm mamn (5.11.19) 
Ss) timom 


where Lm'm is the quantum mechanical matrix element of the operator 
L between the states m’ and m: 


Note that different € values do not interfere. Since (Lz)mm' = Mêmm', the 
z-component of angular momentum is particularly simple. For a single 
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€,m multipole, it is 
F,= tee + 1)m|acin|? (5.11.21) 
27 


and the ratio of the z-component of angular momentum radiated to energy 
radiated is 


3 


F,_m 
dW w 
dt 


(5.11.22) 


which suggests that the wave described by a,,, would in quantum theory 
carry a z-component of angular momentum fim. The suggestion is correct. 

The mixed EM fluxes are not so simple. They involve integrals of the 
form 


L? L? 
Fen = [|= pir x Labe -É ptr x Lyw dQ (5.11.23) 
w @ 


$ M E 
which connect a¢’s and a¢..1’s. 


5.12. MULTIPOLE SCATTERING BY A DIELECTRIC 


The expansion of a field amplitude in electric and magnetic multipoles of 
given £ makes it possible to reduce three-dimensional scattering problems 
to one radial dimension for each €, provided the system is isotropic—that 
is, rotationally invariant. 

We illustrate this by deriving the general equations for scattering by 
an isotropic medium and carry out the calculation for the simple case of 
a uniform spherical dielectric. 

As usual, we start with Maxwell’s equations: 


VxE=iwB, V x B= -lweE, and V-c«E= 0, 
which give the equation for E 


Vx (VX E) = 0 cE 
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or, from (5.10.18), 


Vx 
E=Vi.+Lbu+—— ve 
iw 
sO 


vx Vx 


iw 


-v?(Lym + p ve) = ve( Ve + Loy + L ve) (5.12.1) 


iw 


and 


y- (vv. hp ve) =0. (5.12.2) 
lw 


We project the magnetic amplitude by operating on (5.12.1) with L 
dot. Since the commutator [L, e(r)] = 0, we find 


-VL ym = w EL? by (5.12.3) 
or 


-Vym = webu (5.12.4) 


provided we leave out the undetermined £ = 0 component that does not 
contribute to the fields. 

Equation (5.12.4) shows that the magnetic amplitude is decoupled 
from the longitudinal and electric amplitudes, and can be found 
independently of the other two. Since (5.12.4) is a scalar equation, the 
function ym can be expanded in amplitudes of definite €, just as in the 
scalar case studied earlier. There is one important difference: The incom- 
ing field has a polarization direction that produces a significant azimuthal 
dependence in the scattering and requires the introduction of Y¢,,’s for 
m + 0 in the expansion. We will see this shortly when we take up scat- 
tering by a sphere. 

We project the electric amplitude in (5.12.1) by the operator L x V, 
which gives 


V*L2 


Lw 


y? be= oP LX V-e( Ve + Lyu + =y) 


iw 
or, with e' = de/dr, 


(V?)?L? 


Iw 


272 

We=w'LX?- e (Wi + Loy + EA ve) a oe- Se PE 
iw iw 

(5.12.5) 
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or 
4,2 vot 2 Vr-V4+1 
SE ewe (vu. oe ee) ve) 
Lw r w 
or 
2 r-V+i1 
Vibe + wEV be EEN (uv + ve) 
or 


V2(V2 + w?) Ye = w?(1 — €) Ve -— Sefu pEr a we). 
r w 
(5.12.6) 


The equation for w, is given by (5.12.2): 
2 E' L? 

eV'b, +— |r: Vb, + — ve} = 0. (5.12.7) 
r w 


Thus, yw, and We are coupled via (5.12.6) and (5.12.7); however, 
since both equations are spherically symmetric, different £ values are not 
coupled. The coupling of Y, and we results from the fact that the parity 
of each Y; m is the same in y, and We, but opposite in iy. Thus, the 
rotational and inversion invariance of the system that requires decoupling 
of the different € values permits coupling of y, and ypg; since any interac- 
tion that is not forbidden is, in general, allowed, this coupling appears. 
Note that in especially simple cases, such as our example of a spherical 
dielectric, with e' = 0 except at the surface of the sphere, this coupling 
may disappear. We find again the presence of an apparent pole at V? = 0 
(leading to 1/r at infinity), which now must cancel between the actual 
fields produced by yw, and wy. 

Clearly, the general problem of scattering by a dielectric is a hard 
one, and we do not address it here. We turn instead to the special case 
of scattering by a uniform dielectric sphere. 

The equations satisfied by the potentials follow from (5.12.1) and 
(5.12.2) by setting e equal to a constant: e = 1 outside the sphere and e = 
e inside the sphere. Thus, from (5.12.7), V’w, =0, inside and outside, 
and we are free to set y, = 0 and Yr, = 0 (provided we can satisfy the 
appropriate boundary conditions at the surface without them). From 
(5.12.4) and (5.12.6), we find 


(V+ ew?)by=0 and (V?+ew)We=0. (5.12.8) 
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The boundary condition at r= R (the radius) are, as usual, Er and 
By continuous, Dyormai and Brorma Continuous. 
From 


ee (5.12.9) 


lw 


E = Lyu + 


we look for continuity in Er. Since Ey continuous at r= R implies L- E 
continuous, L'a must be continuous; therefore, Ym iS continuous at 
the spherical boundary. The tangential component of the second term 


is Î x (V x L), so we must have 


rx(VXL)e continuous, (5.12.10) 
or 
Liter: Vyer 
and hence, 
GQ+r-Vbe= (1 + rŠ) w, continuous. (5.12.11) 
r 
From 
vVxL VL 
B=——_Uut+—y ve (5.12.12) 
iw w 
or 
VxL 
B=—— ty — Lye (5.12.13) 
iw 
we learn that 
€} g = continuous (5.12.14) 
and 
ð ; 
(1 + rŽ) ùm = continuous (5.12.15) 
r. 


It is easy to verify (as we already know from Problem 3.9) that the 
boundary conditions we have just found from the tangential continuity of 
E and B also guarantee the normal continuity of B and D = eE. Therefore, 
Wm and ypg are completely determined: They satisfy the wave equation 
(5.12.8) and the boundary conditions (5.12.10), (5.12.11), (5.12.14), and 
(5.12.15). 
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iy and we have Yem expansions 


M 
pm = D CHnYemlð, pon (5.12.16) 
Em r 
and 
E 
e= Y CEnYem < a (5.12.17) 
Em r 


where the functions v% and vf are independent of m. 

The coefficients C%,, and C7, must be determined by the incoming 
wave boundary condition at large r. 

We easily solve for vy (electric or magnetic). For r < R, 


vu, = Uker) (5.12.18) 


where k, = Vew =nw. Forr >R, 
Ve = ÁcUet Bege (5.12.19) 


where A, and B, are determined by the boundary conditions, and u, and 
qv are defined in (5.5.17) and (5.6.12). 

Note that qz may be chosen to be real, since the equations and 
boundary conditions are real; then A, and Be are real, and define a phase 
shift via 


B 
— = tan 6¢. 
5 


Matching the boundary conditions for v7’, we have 


ve 7 m > 
ltr : continuous and v; continuous. 
ri r 


Thus, 


Acue(nkR) = cos 8cu (kR) + sin 8eqel(kR) (5.12.20) 


and 


Aue(nkR) = cos 6,u7(kKR) + sin 6.q7(kKR). (5.12.21) 


The prime in (5.12.21) and (5.12.22) stands for differentiation with respect 
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to R, not kR. The parameter € 7 to go into Eq. (5.6.15) is 


M ue(nkR) 
ue(nkR) : 


(5.12.22) 


which for small k goes like (£ + 1)/R. Thus, from (5.6.16) the normal 
term in ô% of order k?“*' vanishes; the first nonvanishing term is of order 
k? clearly a consequence of the w° dependence of the interaction 
strength. This does not happen for the electric amplitude, as we now 
show. 

The boundary conditions for we are 


ew, continuous (5.12.23) 
and 
ô ; 
(1 + rŽ) we continuous. (5.12.24) 
r. 
With 
Ve 
te=—, (5.12.25) 
r 
these become 
eve continuous (5.12.26) 
and 
E 
= continuous. (5.12.27) 
r 


In this case, the parameter £ becomes for small k 


ee 2.2 
Ees (5.12.28) 


and, from (5.6.16), 


(‘ +1 £+ ) 
2€+2 
bp ert ___R AER OR 5.12.29) 


(2€ + 1)!(2€ — 1)! (= 7 €) 


€ 
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which for € = 1 is 


Sage aR : 


3 5.12.30 
e+t2 ( ) 


To relate this calculation to scattering, we must determine the incom- 
ing fields y% and wt, in terms of 


A 


Eo= 8e" and Bo=k xE et", (5.12.31) 


We use the expansion (5.5.14) to expand e’** in spherical harmonics: 


ey (2€ + 1) Pek -P)jelkr) i‘, (5.12.32) 


and 
E 
Ey = yp eI 1) Pek Puci. (5.12.33) 


We solve for wh, and yẹ as usual 


1 A L E E 
= PAGI HDU 5 Pe (he FY i (5.12.34) 
€ 
and 
1 k 
0 AU iker AO ikr 
ra z 7 =L x—:@e 
He i VL? A w 
1 A ` ik-r 
= ee Le” (5.12.35) 


=D (20 + i%, (kr) = aih k x LP,;(k-t). (5.12.36) 


We have to deal with two spherical harmonics of order £: 


AO 
F bi L BA 
M i P.(k -) (5.12.37) 
and 
Ba R’ x k $ L BN 
Ye= <> p R$). (5.12.38) 
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Note that both é° = a, and @° x k = a2 are transverse vectors to the wave 
vector k. For either of them, 


è L A Aw 
Yi = ay 75 Polk) 


a ap 
€(€+1) i 


A A 
aro kp 
t 


E (5.12.39) 


Now choose k in the z-direction. Since f =kcos ð + îsin 8 cose +f 
sin 8 sin g, T x k is 


A 


f x k= —jsin 0 cos ọ + fsin 0 sin @ (5.12.40) 


and 


= Ge sin o — aj, cos p) sin OP {(cos 6), (5.12.41) 
i 


which is a spherical harmonic of order £, linear in e*'*, and hence a 
combination of Y: (0, ¢) and Y,-_.(0, ¢). In expanding the scattering 
amplitude f(@, ¢), we must accordingly introduce the spherical functions 
Y* and YẸ defined in (5.12.37) and (5.12.38). 

As in the scalar case studied earlier, we require that the Ye amplitudes 
consist of the known incoming wave yœ, plus an outgoing wave with an 
unknown coefficient. That coefficient is then fixed by the requirement that 
the resulting wave function for each (£, m) be a multiple of the known 
radial solution ve. 

The incoming magnetic field Y% is 


dh, = Eiee + Duc Y¥(0, ¢), (5.12.42) 
and the incoming electric field y% is 


j= -I if (2€ + luc YF(0, ¢). (5.12.43) 
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The scattered field is, in each case, 


y“ = 


x fe 


Dita, (20 + 1) YO, 9) fi (5.12.44) 
é 


where feis determined by the requirement that as r— ~, 
y+ Word (2€ + 1) ¥. C; sin( kr 5 = + a4) (5.12.45) 
r € 


for some C;. Equation (5.12.45), in turn, requires 


ee ] 


2 5.12.46 
i aun ( ) 


for the magnetic or electric amplitude. 
The scattered electric field is now 


v 
E" = Lyi + — X LYE (5.12.47) 
Lw 
or, asymptotically, 
E” > Lyi + Fx LYF, 


and the scattering cross section is the square of the vector coefficient of 
e™™/r in (E* (for @°* . @° = 1), as defined in (5.7.5); for each partial wave, 
that coefficient is 


fe = (2€ + (LY fY +8 xX LY? f?7). (5.12.48) 


We calculate separately the noninterfering total magnetic and electric 
cross section for each ¢: 


o% = (2¢ + Ire | aay ay 


= (€ + 1)(2€ + iste fan Y/YZ. (5.12.49) 
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But 
AO A 
u_ Ê L P, 
e(€ +1) 
and 
AO E aod R 
[anveevy = fanr, Laas 
[e(é + 1)] 
i alte dis a (5.12.50) 
2(2€ + 1)(€(€ + 1)) 
so that 
naM 
m _4n(2U + 1)|e™ - 1? 
= | 5.12.51 
s 2k? 2i ( ) 
a? is a little harder: 
of = (2€ +1) ier f ane LYG x LYE) (5.12.52) 
= (2€ + Disp fao YE*TXL-îX Lv? + of?) (5.12.53) 
r 
=(2€ +1)| fF)? eL + 1) fao Yer ye. 
But 
3 Rxk)- L 
Y; > feel 2 Pe, 
so again, 
4n(2€ + 1) erie — 1)2 
E- 5.12.54 
cs 2K? 2i ( ) 
Substituting the result for 8# for £ = 1, we find for small k, 
8r [Re -DY 
of T e (ZAD) i (5.12.55) 
3 e+2 


We recover, for small e — 1, the result given by (5.9.24), which as k > 0 
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gives 
-1) 4 
reg E i e (5.12.56) 
4T 3 
and 
-1 2 2 
ar= o($4) Red (5.12.57) 


The result (5.12.55) shows that the electrostatic polarizability a = 
(e — 1)/(e + 2) R? of a dielectric sphere dominates the low-frequency scat- 
tering cross section. 


CHAPTER 5 PROBLEMS 


5.1. From the reality of the interaction applied to the scattering of a 
scalar wave, we found the scattering amplitude to go from wave 
number k; to kz (with |k,| = |k2| = k), f(k2, ki), was given by 


f (kas ki) = È QE + 1) Pr (Br Ra) fe (K) (1) 

where 
ee 1 esin, 
k)= = X 2 

filk) ae ‘ (2) 

and 6 real. Jų therefore satisfies a partial wave optical theorem: 
Im fe = kl fil. 

From (1) and (2), prove the generalized optical theorem for |k2| = 


|k,| =k: 


Im f (k2, ki) = A 


F(a, k) f(ki, k). 

TT 

5.2. For a scalar field scattered by a complex potential, the wave function 
cannot be chosen real, and the asymptotic form for a given € will 
not approach qe — sin[kr — (€2/2) + 6,] as r— œ with 6, real. The 
asymptotic wave function q: will approach q; — @ sin [kr — (€7/2)] 
+ B cos [kr — (€27/2)|, where a and 8 are complex functions of k. 


(a) Show that one can always uniquely (to within +tn7) find a 6, 
such that qe —> A sin [kr ~ (€7/2) + ê] with complex 6, and that 
the scattering amplitude for that case is f; = (e7°« — 1)/2ik. 


*5,3. 
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(b) Suppose energy is absorbed by the system. Apply the energy 


calculation used to derive the optical theorem to a single partial 
wave to decide what the sign of Im 6, must be if the target 
absorbs energy. 


(c) Give an example of a (nonlocal) potential that will scatter only 


an £ wave. 


Consider the scattering of light by a perfectly conducting sphere of 
radius R. The complex electric field outside the sphere satisfies the 
wave equation 


(V2 + w2)E=0 (3) 


and the divergence condition 


V-E=0. (4) 


(a) From (3) and (4) and the multipole expansion 


(b) 


(c 


(d 


— 


— 


Vx 


E= Vy, + EU tite (5) 


tw 
verify that outside the sphere, for € # 0, 
Vw, =0, (V2 + w7)by=0, and VV? +07)be=0. (6) 
With 
be = We, + Wer, (7) 

and 

(w? + Ve, =0, Vehe = 9, (8) 
verify from (3-8) that we must have 


vx 


L 
Vib + Wry =0 


IW 


and that this equation can be satisfied outside the conductor for 
both y’s # 0. 

Write the boundary condition that must be satisfied by wz, and 
Wy at the surface of the sphere. From these, calculate the electric 
and magnetic phase shifts 8% and ôf and the low-frequency 
limits of the partial wave cross sections 07 and ø% 

In part (c), you found that 57 and ô% > k?“*! for small k. How 
can this be? We found in Section 5.12 that, for any e, 
&™ — k?3 as k — 0; but we can include conductivity in € by 
letting € — e + (io/w), and a perfect conductor by taking the 
limit ø > œ, or € > %. What went wrong? 
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(e) From (5.6.15) and (5.12.22), calculate the first nonvanishing 
power of k and its coefficient in the magnetic phase shift 
f%(k). Repeat the calculation starting from the perturbative 
result (5.9.24) and show that the results agree for small e — 1. 

(f) Calculate the € = 1 electric phase shift from (5.9.24) and show 
that it agrees with (5.12.30) for small € — 1 and k — 0. 


Calculate the € = 0 scattering length for a scalar field scattered by 
a square well potential 


U(r) = Uo, r<R 
U=0, r>R- 


The equation for qe is the Schrödinger equation (for € = 0): 


dq 2 
-74 + U(r)q = kq. 
3 (r) 4 q 


The boundary conditions are g(r = 0) = 0, and q and dq/dr continu- 


ous at r= R. Consider separately the three cases (a) Up > 0, (b) 
Uo < 0 and V—UpR <1, and, (c) Uo < 0 and V— UR close to 7/2. 


The Schrédinger equation (5.3.2) has the integral form (with a slight 
redefinition of U) 


1 goler] 


o(r) = em — 


4x J |r—r'| 


U(r') dr’ (r’). 


It is sometimes useful to Fourier-transform the integral equation. 
To do this, show first that 
(a) 

ikyr 


Caden 4m dk e™" 


im i 
r «0+ (2m) J k? — ki — ie 


(b) Then show that the integral equation for y(k)= 
far e *Td(r) is 


4r 


= 3 534, _ = 
x(k) = (27) 8° (k - ko) = 5 


f æwua, k’)x(k') 


where U(k, k’) = U/(2m)? | e™™"U (r) e* "dr. 


(c) Show that the scattering amplitude f is 
1 
f= > x! dk'U (ky, k’) x(k’), 
4r 


where k; = kof, with F the direction of observation, and therefore 


5.6. 


5.7. 


*5.8. 
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f= a (k? — k3)(x(k) — (2)? 6°(k - ko)). 
T 


(d) Check that for small U (to be defined) and kọ — 0, the results 
of (c) above agree with what you found in Problem 5.4. 


(e) Repeat (a-c) above for U, a nonlocal potential, defined by 
Ud(r) = U(r, r')ġ(r')dr'. 
Show that the only change is that U(k, k’) becomes 


1 
(2r) 


U(k,k')= { e**dr U(r, r')e™" dr’, 


A nonlocal potential for which the scattering equation can be solved 
exactly (i.e., in terms of integrals) is a product potential: 


U(r, r') = g(r) g*(r'). 


Find the scattering amplitude in terms of the Fourier transform of 
8: 


h(k) = [ema dr. 


Apply the factorization techniques of Section 5.5 to the Schrédinger 
equation for the one-dimensional harmonic oscillator: 


H n= -1< tla) m= En as -0o <y <00, 
ý ( 2dx? 2 ý : 


The factors are a, = —(d/dx) + x and a- = (d/dx) + x. 


(a) Show that aa- = 2H — 1 and a-a, =2H+1. 

(b) Show that if Hy = Ey, then Ha, = (E+ 1) a, so that a, is 
a raising operator, and that Ha_w = (E — 1) 4 so that a- is a 
lowering operator. 


(c) The positivity of H shows that there must be an eigenfunction 
of H, Yo, that cannot be lowered. Use this to find the lowest 
eigenvalue (E = 3) and corresponding normalized eigenfunction 
(ho = GP rig). 


(d) The nth eigenfunction, with energy E, =n + 5, is given by 
Pn = Alay" po. 
Calculate the value of A required to normalize y. 


Apply the factorization techniques of Section 5.5 to the Schrödinger 
equation for the isotropic three-dimensional harmonic oscillator, 
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with angular momentum quantum number ¢: 


1d? e(e@+1) 1 

H =(-5S +3r)e 
ce 2d? x 2)“ 

= Eve, O0Osr<æ and v(0)=0. 


The factors are A? = —(d/dr)+(€/r)+r and Açņ = (didr) 
+ (€lr) +r. 


(a) Show that A% raises £ and E; by one unit each, and that A; 
lowers £ and Eç by one unit each. 

(b) Finding the spectrum here is more subtle. The lowest value of 
€ is zero, for which H, becomes H of the preceding problem, 
with the difference that in this case the wave function must 
vanish at r=0. Thus, only odd harmonic oscillator eigen- 
functions are allowed. 

Using all this, give the eigenvalues E,(n,) as explicit 
functions of £ and n, (the harmonic oscillator quantum number). 
Give the allowed values of € and n,. 

(© Show how the € = 0 wave functions fail to be lowered by A;. 

(d) Using the separability of the Schrödinger equation, 


1/d? d? ee 1 
Glan aa)? ge ae 


— 


find the eigenvalues E as sums and the eigenfunction tp, n,n, 
as products of the eigenvalues and eigenfunctions of the three 
one dimensional oscillators. 

Write explicitly all the eigenfunctions for which n, + n, +n, 
= 2 and show how these appear when characterized by € and 
n,. 


5.9. Show that the next term in the expansion of h,(kr) at large kr is 


given by 


er £(€+ 1)i 
hake) = eei( t 2kr sea) 
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CHAPTER 6 


Invariance and Special 
Relativity 


6.1. INVARIANCE 


When we say the laws of nature are invariant under a transformation, 
we mean that it is impossible to determine whether that transformation 
has taken place. To illustrate, consider invariance under time translation. 
The physical meaning of this invariance is that were we to go to sleep and 
wake up some time later, no experiments we perform before and after 
our nap (not counting looking at a clock) could teil us how long we had 
been asleep. That is, the laws of Nature do not change with time; they 
are the same now as they were then. 

The mathematical expression of this invariance is that the equations 
governing the system we are describing are invariant under the transforma- 
tion t’ = ¢ + A. For example, Newton’s law of gravitation, 


mo Y Gi mim 
" dt? ti lr; = rf? f 


can be expressed in terms of ¢’: 


Pye ew < r;) mm, 
' dt’? j+i ie ras r;|? l 


Since the two laws are the same, the phenomena they describe 
(gravitational motions) cannot tell us whether we are using f or ¢’ as a 
clock. (See Appendix A.2 for further elementary discussion.) 

Similarly, translational invariance makes it impossible to tell whether 
our laboratory has been picked up and moved to a new location. 
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Rotational invariance makes it impossible to tell whether our labora- 
tory has been turned around. 

Thus, translation and rotation invariance together tell us that space is 
homogeneous and isotropic. 

Invariance with respect to a constant velocity transformation makes 
it impossible to tell if our laboratory has been gently accelerated to a new 
velocity. All these invariance principles are believed to hold exactly.’ 

There is one other class of invariances believed to be exactly true: 
these are gauge invariances analogous to the gauge invariance of electrody- 
namics. 

There are two more space-time connected invariance principles that 
are approximately true: inversion invariance (called P), which forbids 
knowing whether you are looking in a mirror or at the real world; P is 
broken by the weak 8 decay interactions. An invariance that holds much 
more accurately, called CP, forbids knowing whether you are looking at 
the reflection of a particle in a mirror, or at an antiparticle in the real 
world. CP is known to be violated, but very weakly. 

The second approximate space-time invariance, which is also weakly 
broken, is time reversal invariance, called T. This invariance tells us that 
every motion has a reversed motion that is equally possible, with the 
same coordinates and accelerations, but opposite velocities, occurring in 
backward order in time. The classical laws of mechanics and electromag- 
netism are invariant under P and T. 

In relativistic quantum field theory, it is shown that CPT must hold 
exactly. That is, T must be violated just enough to compensate in CPT 
for the violation of CP. CPT says: You cannot tell whether you are 
looking at a certain motion of particles in the real world, or looking in a 
mirror at antiparticles undergoing the time-reversed motion. 

We may express the space-time invariances in terms of infinitesimal 
transformation of coordinates. Thus, for translation in space or time: 


t=r+ At, x’ =x + Ax, y'=y+t Ay, zi’ =z + Az. 
(6.1.1) 


We have three rotations: 


‘Of course, real space in our universe is filled with masses that generate gravitational 
fields. It is our common experience on Earth that space above our planet is neither isotropic 
nor homogeneous. What does appear to be true is that for phenomena occurring on a 
distance and time scale that is small compared to the length and timescale of the gravitational 
field, there exists a class of observers for whom these space-time invariances hold. This 
follows from the equivalence principle, which tells us that an observer in free fall in a 
gravitational field will have no local way of determining that there is a gravitational field in 
the neighborhood. Of course, the falling elevator must not be so large that the convergence 
of the field lines toward their source can be detected. 
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x’ =x + yA@, y =y— xd, zZ' =z. (6.1.2) 


and two more infinitesimal rotations about x and y. 

One can express finite translations and rotations by iterating in- 
finitesimal ones. For translation, this is trivial. For rotations, note that 
(6.1.2) is equivalent to 


C) =(1+iAbo,) C) (6.1.3) 


0 -i 
where a, = ( ; ‘a Repeated N times, with NA@ = 90, the finite rotation 
L 


is 


0) os 


or, as N —> œ, with @ remaining finite, 


1 f + 4 
) = e(*) -( aces 2 ysin F (6.1.5) 
y’ y —x sin 0 + y cos 6 


An inversion cannot be generated by a succession of rotations, since 
the determinant of the rotation matrix a;; in x; = a;;x; (Summation conven- 
tion) is 1, whereas an inversion x; = —x; has determinant —1. 

Finally, we come to the transformation between observers moving 
with constant but different velocities. Newtonian mechanics has an invari- 
ance of this kind, called Galilean invariance, with the transformation given 
by 


Tennie O (6.1.6) 


The Newtonian equations 


d*xi S aby a b 7 
Ma a a F; (xj — x7) (6.1.7) 


are clearly invariant under the transformation (6.1.6) for a force law F; 
that is independent of velocity. Thus, the Newtonian world would have 
velocity invariance (called Galilean relativity) were distance measurements 
to transform according to (6.1.7)—that is, were the distance between 
simultaneous events the same for all observers. 
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Another consequence of (6.1.6) is the addition rule for velocities: 
=! y,, (6.1.8) 


Thus, if observer O observes a velocity v?, an observer O’ will observe 
the velocity 


v? =v? — uj. (6.1.9) 


Evidently, Maxwell’s equations cannot be invariant to the transforma- 
tion (6.1.6), since they predict a unique light velocity c, independent of 
the velocity of the observer or the source. 

Thus, if there is a principle of relativity (invariance under constant 
velocity transformation), then either the transformation (6.1.6) is wrong, 
and Newton’s laws must be modified; or Maxwell’s equations are wrong, 
and the modified equations must somehow contain the possibility of the 
velocity addition law without containing the observer's velocity as a para- 
meter. 

No such modification has been found, although it must have been 
diligently sought. Einstein took the first view: There is an invariance 
principle, Maxwell’s equations are correct, and they are invariant to the 
correct, transformation law (to be found) replacing (6.1.6). 


6.2. THE LORENTZ TRANSFORMATION 


We are thus led to study (first in one dimension) the kinds of transforma- 
tion that could hold between space-time measurements made by different 
observers moving with constant velocity with respect to each other. It will 
turn out that we are almost uniquely led to a Lorentz transformation, with 
a velocity parameter c. Galilean relativity results from c — œ. Einstein’s 
relativity results from c = velocity of light. 

The most general transformation that leaves space homogeneous must 
be linear: 


x' = y(v)(x — vt). (6.2.1) 


Thus, the trajectory x = uf corresponds to x’ = 0. That is, the origin of 
the O' coordinate system moves with velocity v to the right in the O 
system. 

Let us assume that the O system moves (to the left) with velocity 
~v with respect to the O' system. Then, of course, assuming t’ = t, we 
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must have 
x= y(—v)(x' + vt) (6.2.2) 
which together with (6.2.1) implies y = 1; back to Galileo. 
It was the remarkable insight of Einstein here that relations between 
space-time measurements made by different observers cannot be derived 


by logic, but are empirical. There is therefore no reason to insist that time 
intervals between events be invariant. Einstein wrote, instead, 


x’ = y(v)(x — vt) and t' = 6(v)(t — B(v) x). (6.2.3) 
The transformation from —x to —x' should have a velocity —v. Then 
~x' = y(v)(—x — vt) should be equivalent to x’ = y(-v)(x + ut), or 
y(v) = y(-v). Similarly, 
t' = d(v)(t + B(v) x) 
should be equivalent to 
t = 6(—v)(t ~ B(—v) x) 


so that 5(v) = 5(—v) and B(—v) = —B(v). 
Now solve for x,t as functions of x’, 1’. From (6.2.3), 


t 


t= 3 + Bx (6.2.4) 
so that 
x= yo)(x-o(E +8) = roka- ~ | 
or 
xy(l — Bo) =x' + ad (6.2.5) 
and 
Tf, (6.2.6) 
Y 


Since we have here 


x= y(x’ + ut') 
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it follows that 


ya - Bu) = 1. (6.2.7) 
Define 
gaui 
B= Hy) (6.2.8) 


where c(v) has the dimension of velocity. 

We now show that c(v) must be a constant by requiring that two 
successive transformations of the form (6.2.1) and (6.2.3) must again be 
of the same form. That is, we set 


x" = y(v')\(x’ — v't’) (6.2.9) 
and 
= ron(r ż Ta) (6.2.10) 
or 
x" = y(v') yofa(: + e) -t(U + v| (6.2.11) 
and 


t" = y(v) ohh + =) - {5 + “| (6.2.12) 


so that c’ must equal c; thus, c is a constant. The rule for adding velocities 
is 


u= - (6.2.13) 
1+” 
c? 
and 
1 
y) = -== . (6.2.14) 
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We note that the addition law for v and c is 


u= —— =c (6.2.15) 


so that the speed c is a limit, seen to be the same for all observers. Thus, 
the identification of c with the velocity of light will guarantee the constancy 
of that velocity for all observers, consistent with the straightforward inter- 
pretation of Maxwell’s equations. 

Clearly, the simplest three-dimensional expression of this transforma- 
tion is 


x' = y(x - vt), r=y(1-5x), y'=y, z'=z. (6.2.16) 
£ 


A few remarks on (6.2.16) are in order. First, these equations must 
relate space-time intervals. Evidently, the origins of the O and O' space- 
time coordinate system could be different, in which case we would have 


Ux 
x’ = y(x ~ ut) + Xo, v= r(1 = 5) + to 
c 


with xo and to the same for all events instead of (6.2.16). Equation (6.2.16) 
holds when the origins of the two coordinate systems pass each other at 
a time that both observers call zero. If there is any confusion about this 
point, rewrite (6.2.16) as 


Ax'=y(Ax~vAt), Af = (ai - e) (6.2.17) 
c 


where Ax and Aż are the space and time intervals between two events; 
Ax’ and Ar’ are those intervals as measured by the primed observer. 

Second, let us consider the behavior of clocks. Suppose we have a 
clock at rest in O, and the time between ticks (say, birth and death of a 
p meson) is T. Then the time between these events seen by O’ is, since 
they occur at the same position in the O frame, 


T'=yT (i.e., a longer time). (6.2.18) 


Third, moving rods appear shorter. Suppose a rod is at rest in O’ 
with length L, that is, xi ~ x; = L. Then, a measurement of the distance 
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xı — x at the same O time would give 


L ee 
L = y(x = x2) or (x= x2) = 7=L 4 1--, (6.2.19) 
y c 


the Lorentz-Fitzgerald contraction. 
The simplest way to characterize the Lorentz transformation (6.2.17) 
is by the invariance of the quantity 


(Any? = (an? - EX (6.2.20) 


or, in three dimensions, 
2 
(ar)? = (an? - QE, (6.2.21) 
c 


Thus, the transformation between space-time measurements is (remember 
the summation convention) 


x =a" + AM x” (6.2.22) 


where u =0,1,2,3, x° =ct, and the invariant interval? between two 
events is 


(Ar)? = Ax” Ax’ = Ax, Ax” (6.2.23) 
with 
Noo = 1, Nij = —8;;, No = Nio =O, i= 1,2,3. 
In our one-dimensional example, with x = x), etc., 
v v 
Mi=y, Ab- A A% =y, Aù =- A (6.2.24) 


The transformations (6.2.22) form the Poincaré group. The subgroup 
with a” = 0 is the Lorentz group. The Lorentz group has six parameters: 
three velocity transformations (boosts) and three rotations. The Poincaré 
group has four more: the four space-time translations. 


? The reader is warned that there is no general agreement on the sign of y. Our choice 
makes (A7)? > 0 for a timelike interval, that is, one with | Ar} > |Axi/c. 
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From (6.2.23) we learn that 


Ax’* 9, Ax” = AY Ax Nur A AÀ = Ax? Noa Ax’ (6.2.25) 
or 


A” oNu A” = Naa. (6.2.26) 
So, if we define 
A” =A, (a matrix) and A“, = Apo =A, (6.2.27) 
Eq. (6.2.26) requires that 
AnA =n (6.2.28) 


as a matrix identity defining the most general Lorentz transformation. 
Just as in the case of rotations, we find from (6.2.28), 


det n = det(A’nA) = (det A)* det n 
so det A = +1. 
Transformations with determinant +1 can be reached by a succession 
of infinitesimal Lorentz transformations.’ Thus, analogous to (6.1.2), we 
can consider 


x’ =x — Aut, t'=t— Aux 


(where from now on we use units in which c = 1). With € = NAv, 
f N 
CREE Seng) 
t N t 1 0 


=g $7 (*) = cosh é — ø sinh &) (*) 


As N >>, 


so that 


@ : ( x cosh £ — t sinh a (6.2.29) 


t —x sinh £ + tcosh £ 


>This does not include x’* = —x*. 
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The transformation velocity is clearly 
= tanh £. (6.2.30) 


The most general infinitesimal Lorentz transformation is specified 
by a 4x4 antisymmetric matrix that we call esa. Thus, we set 
AY, =8", +e”, and with €,, = no,€",, (6.2.26) shows that 


Eor t Eo =O. (6.2.31) 


Returning to the invariant (dr)*, we remark that the notation is 
deceptive, since (dr)? can be negative or positive. For two simultaneous 
events, (At)* = 0, (AT)? = —(Ax)’; the interval is called spacelike. For two 
events at the same spatial location, (Ax) =0 and (Art)? = (Ar)*; the 
interval is called timelike. In general, if the interval is spacelike, there is 
a coordinate system within which the events are simultaneous. In 
one dimension, Ar’ = (At — vAx)/ V1 — v?, so setting v = At/Ax, with 
|At/Ax|< 1 for a spacelike interval, we find Ar’ =0. Clearly, slightly 
smaller or greater velocities than v can make Ar’ either positive or 
negative; time order for a spacelike interval is not invariant. 

Similarly, for a timelike interval, there is always an observer for whom 
the two events happen at the same place. Also, for a timelike interval, 
the sign of the time difference is clearly invariant. 

We finally consider the consequence of an object moving with a super- 
luminal velocity, that is, a velocity greater than 1. Suppose the object goes 
from x1, f, to X2, t2, where x2 — xı = v(t. — tı), with v > 1. Then the (1, 2) 
interval is spacelike, and the time order f; and h will be different for 
different observers. That is, it will be impossible to say whether the object 
went from 1 to 2 or from 2 to 1. Thus, no causal relationship between 
events 1 and 2 can be established. 

How can we write equations that are covariant with respect to the 
Lorentz transformation—that is, equations which are the same for all 
Lorentz observers, but which do not contain explicitly the velocity of the 
observer with respect to a given coordinate system? 

We have a clue in our treatment of rotational invariance. There, we 
required that all equations be tensor equations— the tensors being defined 
by a specific set of transformations: rotations. Thus, the basic tensor 
in classical mechanics is a vector x;, or better, dx;, the (infinitesimal) 
displacement between two points. Since dt is a rotational invariant, dx;/dt, 
d°x,/dt?, etc. are all vectors, so linear equations joining d*x;/dt? to other 
vectors, for example, (x; — y;)/|x — y|*, will be vector equations and hence 
invariant under rotations. 

We can do the same for Lorentz transformations. We define a contra- 
variant vector under Lorentz transformations as an object V* such that 
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Ve = APV”, 
An immediate example of such a vector is the coordinate differential, 
dx" = (dx°, dx', dx?, dx°), 
since dx'* = A”, dx” precisely describes the way coordinate differentials 
(space-time intervals between events) transform. 


A more compact and convenient way of characterizing the transfor- 
mation properties of dx“ is 


1y 
ih Ss ay (6.2.33) 
ox” 


or, for a general contravariant vector, 


How do we construct a finite Lorentz (or four-) vector? We clearly 
need an analogue to the use of dt described above for rotational invari- 
ance. We recall the invariant 


(dr)* = (dt)? — (dx)? = dx” np dx”. 
If the two events are separated by a timelike interval (and for two 


locations of a particle moving with v < 1, they will be), (dr) is positive, 
and dr = dtV1 — v? will be invariant; thus, 


1 v 


ane? = (T= Ta) 


(6.2.34) 


will be a four-vector, sometimes called the four-velocity. 

We can now write a covariant equation for momentum conservation. 
Let us call the four-vector p* (where mu” = p*) the four-momentum of 
the particle with mass m. Then a covariant conservation law for a two- 
body interaction, with a and b incoming, possibly different c and d outgo- 
ing, would be 


PY = p*+ ph=pet pa. (6.2.35) 


The covariance of the momentum conservation law requires a fourth 
conservation law: energy. The energy, of course, includes rest energy: 
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The four-vector 


p“ = (p°, p) (6.2.36) 
has 
mu 
= 6.2.37 
DEn or ( ) 
and 
0 m 
z . 6.2.38 
po aes ( ) 


For v <1, p ~ mv and p? ~ m + mv”/2, the nonrelativistic forms, but 
with the rest energy (mc?) added. 

The individual momenta are timelike with positive time components, 
so their sum P* is timelike. Therefore, there is a coordinate system in 
which the space component P’ is zero; it is called the rest system and is 
usually the most convenient place to carry out calculations. We consider 
the two-body example. First, we express the energy in terms of the mo- 
mentum of each particle: 


mv p 
= SS 6.2.39 
da er A Ee en 
and 
p? = Vp? + mè. (6.2.40) 


Therefore, in the rest system of a and b, p, = —p, = p and 


P?=p + p= Vm2 + p° + Vm + p?. (6.2.41) 
Since momentum is conserved, pe = — pa = p’ and energy is conserved, 
P? = p+ pa = Vm + pp’? + V2 +p”. (6.2.42) 


The system will be exothermic or endothermic according to whether 
Ma + mM, is greater than or less than m, + mg. 
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6.3. LORENTZ TENSORS 


Since, for any contravariant vector, V“,,V” is invariant, so is 
(VE + U") nV" + U”) 


and hence so is U“n,,V”. Thus, given a contravariant vector U”, there 
exists an object 


U, = na U” (6.3.1) 
such that U,V” is invariant for any vectors Up and V*#., 
U „ is called a covariant vector. From (6.3.1), with U“ = (U®, U’), we 
have* 


U, = (U®, ~U‘) = (Uo, U;). (6.3.2) 


Tensors under rotation have the property that their covariant and contra- 
variant representations are the same. However, we note that if we had 
used a nonorthogonal basis system, say, e,, as the basis vectors in three- 
dimensional space, then any vector V could be expanded as 


v=) e, V^ (6.3.3) 
À 
and would be invariant under rotation. In particular, for the vector 


dx=} e, dx", (6.3.4) 
À 


the dx*’s would transform contravariantly, and since dx is invariant, the 
e,'s transform covariantly. 
The transformation law for covariant Lorentz vectors is 


F an 
U= zU. (6.3.5) 
since with 
ax'* 
rye o 
y'#= a Y7, 
“We usually represent four-vectors and tensors with Greek indices, p, v,..., and 


three-dimensional space vectors with latin indices, i,j,k... .- 
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we have 


v iH 
ôx êX UV" = U,8°V" = U,V? (6.3.6) 


, IH 
On H x7 


where 6% is the usual Kronecker delta. 
Conversely, if U,V” is invariant and V“ is an arbitrary contravariant 


vector, then U,, is a covariant vector, since 


te 
ULV” =U, V" = U,V" 
ax” 


from which 


Lorentz tensors are constructed in the same way as rotational tensors, 
except that they can be covariant, contravariant, or mixed. Thus, we have 


a second-rank contravariant tensor 


ry n Ox AX pon (6.3.7) 
ax? ax” 
a second-rank covariant tensor 
ax” ax” 
‘a i pTi’ (6.3.8) 
ðx'" ax’ 
and a second-rank mixed tensor 
ax'* ax* 
pee re, (6.3.9) 
ax” ax'” 
Symmetry properties are invariant: For example, let 7,, = +T». Then 
OX, OX 7 OX, OX 
Tiv E Z à oÀ and Tok = A ors 
ax’* ax!” ax’” ax'* 


interchanging the dummy indices ø and A gives 
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There are several universal tensors. 6“ is a second-rank mixed tensor: 


piu = Ox Ox!" oe = oe” ax 
, = = 


ax’” axt 7 ax ax” 


% 
YẸ 


(6.3.10) 


Nu» iS a second-rank covariant tensor: With x“ and y” as contravariant 
vectors, we see, by the definition of the Lorentz transformation, that 


xN yy = x nie” 


is invariant, where n. is the same function of its indices as n,,. Since 


x” ax'” 
te x’, rm T 
ox” ax? á 
we have 
= a OX Ox!” 
MMR ee AY 
so that 
ax'* ox" , 
Naio = ax’ ax” Nev: (6.3.11) 


Of course, (6.3.11) is equivalent to the original equations that determined 
the Lorentz transformation matrix. The result could also have been de- 
duced from the fact that 7,,V” transforms as a covariant vector for any 
V”, since if T,,V” transforms like a covariant vector for any contravariant 
V”, then T,,, is a tensor. The proof is identical to the one given above for 
a covariant vector. Clearly, these rules apply to tensors of arbitrary rank: 
Any covariant index can, with summation, cancel a contravariant index. 
Thus, with A and B tensors, A,.,,B"" = Ti, is a mixed tensor, as 
indicated. Single indices can be lowered with 7,,. However, note that 
NruNuv = 92 and is therefore a mixed tensor; it follows that Nip = 7°“ is 
a second-rank contravariant tensor as well as a second-rank covariant 
tensor. Thus, 7"” = 7,, can also be used to raise indices and to lower 
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the rank of a tensor by tracing: 


TrT ap = "Ty 
or with 
Pye tee: ee Pas (6.3.12) 


One more universal tensor; Eavan, is defined to be totally antisymmetric 
in wvAn; hence, all four indices must be different, so €pran = £ €o123, the 
plus sign for vàn, an even permutation of 0123, the minus sign for an 
odd permutation. We define € 123 = 1. (Beware of applying the three- 
dimensional rule that cyclic permutations are even. They are not. In four 
dimensions they are odd.) 

ls € a tensor? We calculate 


, 0x ax*® ax® ax? 
Envan ax’ ax’ ax” ax’ 


Eoiaß à 


Since €’ has the total antisymmetry of €, we need only calculate 


; ax? ax* ax“ ax? 
€0123 FB tae 
ax’? ax'! ax’? ax 


2 det“). (6.3.13) 
x 


Eoriaß 


So, €x»ay transforms like a tensor for proper Lorentz transformations, but 
like a pseudotensor for inversions. 


6.4. TENSOR FIELDS: COVARIANT 
ELECTRODYNAMICS 


Tensor fields are functions of x and ¢ that transform according to tensor 
laws. Thus, a tensor of rank zero is a scalar, and a scalar field is an 
invariant, say, W(x).° 

The invariance of the field means that two Lorentz observers observ- 
ing the field will measure the same value of y at the same space-time 
point, so W’(x'(x)) = w(x). That is, 


“From now on, we will frequently use symbols like x and y to stand for a four-vector, 
x" and y“. Thus, a scalar field w(x) = y(x®, x). 
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W(x’) = W(x) (6.4.1) 


where x’ and x represent the same space-time point. 
The prototype covariant vector field is the gradient of a scalar field: 


ow 
V, = ae (6.4.2) 
since 
ð 1 7 ð f) T ð. o 
EAA L I R A (6.4.3) 
ax'* ax? ax'® ax” 
Derivatives of tensor fields generate higher tensors, for example, 
arv axt pe (6.4. ) 
Derivatives can also be used to lower rank by tracing: 
ð 
— V" =V, (6.4.5) 
ax” 


a scalar. 
The first example of a physical scalar field is the scalar density asso- 
ciated with a point particle, moving along a given trajectory 
x, =xp(t) (6.4.6) 
or more covariantly, 
xt = x8(r) (6.4.7) 
where dr is the proper time along the trajectory: 
dr=dx°®V1-v?. (6.4.8) 
We obtain (6.4.6) from (6.4.7) by solving x° = x°(r) for 7 as a function 


of x° and substituting it into x‘(7). 
We wish to construct a scalar density® that, for a point particle, must 


°We here give the word “density” its physics meaning. The expressions scalar (and 
vector and tensor) density are sometimes used to designate certain transformation properties 
under general coordinate transformations [as in (7.6.15)]. That is not the case here. 
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be proportional to a three-dimensional delta function 
d(x°, x) = A8?(x ~ x,(x°)) (6.4.9) 


where A may depend on the velocity of the particle. 

To show that an A exists, we note that the four-dimensional volume 
element dfx = dx°d?x is invariant, since the Jacobian from x to x’ is just 
the absolute value of the determinant 


which is equal to 1. Therefore, the four-dimensional delta function 
5*(x — xp(r)) = (x° — xP (7) 6°(x — x,(7)) (6.4.10) 


is invariant, as is its integral over the invariant 7 [which is held to a unique 
value r(x°) by the first delta function in (6.4.10)]. There results 


d(x) = far a(x” — x3(1)) 8°(x — x,(7)) 
= s 5°(x — x,(7)) 


at x}(7) = x°, so 


d(x) = V1 ~ v? &(x — x,(x°)) (6.4.11) 
where 
eee dx) 
dx? 


Similarly, a four-vector density for a point particle can be defined by 


ro 
je SPs 
dt 
that is, 


y= Í ar SEO) 5'(x — x,(7)) 
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= de dr 


oe 8°(x — x,(x0)) (6.4.12) 


or 


J? = 8'(x—x,(x°)) and = J =vd°(x—x,(x°)). (6.4.13) 


We recognize in (6.4.13) the electromagnetic charge and current density 
of a point-particle-carrying unit charge. For a particle of charge q, we 
would have 


je =qJ*. (6.4.14) 
Finally, we construct a second-rank tensor density for a point particle: 


_ dp dx} dr 


Tree 
dr dr dx? 


8° (x — x,(x°)). (6.4.15) 


Covariant densities can be constructed by using the tensor 7,, to lower 
contravariant indices. Remember that this operation is actually very 
simple: Leave the Oth component alone and change the sign of the three 
space components. 

A useful clue to help construct a relativistic electrodynamics is fur- 
nished by (6.4.13), which tells us that the charge and current densities of 
a point particle form a contravariant four-vector; so therefore does a sum 
of these objects over different particles. Therefore, a general charge and 
current density form a contravariant vector field j” (x). The local charge 
conservation law is a scalar equation 


gal pyje ye (6.4.16) 
0. Ox 


which we know is satisfied by (6.4.13). (See Problem 2.1.) 
Recall now the equation for the vector and scalar potentials in the 
Lorentz gauge: 


2__ 2 ; ene yE 
(v = aa = —4rj and (v = (ax? p= —4rp. 
(6.4.17) 


Since 
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is a scalar operator, it is natural to treat A and ¢ as a contravariant 
four-vector. In general, of course, there can be no required relativistic 
transformation properties for A and ¢, since A and ¢ are only determined 
to within a gauge transformation, and gauge transformations have no 
special covariance properties. Note however that the Lorentz gauge allows 
A and ¢ to be given vector transformation properties, since the Lorentz 
condition 


VAP ee =o, 
Ox 


expressed covariantly becomes 


EaP =0, (6.4.18) 
ax" 
A gauge transformation takes 
: ðX 
A, into A, +—. (6.4.19) 
ax" 


None of these considerations chooses a sign for the contravariant 
vector A“, It is actually most convenient to deal with the covariant vector 


A,,, which we take to be given in terms of the vector and scalar potentials 
A and Ņ by 


A, = (~ 4, A). (6.4.20) 


A second clue tells us how to obtain the fields. Remember B = V x A, 
or in tensor language’ Bi; = 0;A; — 0;A;, so that Bı2 = B3, etc. The ob- 
vious covariant generalization is to define a field tensor’: 


Fur =LA O,A,. (6.4.21) 
The antisymmetric tensor F,, is gauge invariant (under 
A, >A, + dx/dx") and has the right number of components to generate 


the electromagnetic field: F32, F23, F3ı (which we have defined to generate 
B,;) and Fio, F20, F3o (which must generate E;). Let us check: 


7We introduce here another convention: 6, = d/ax" and a" = d/ax,- 
"Here again, the reader is warned that there is no general consensus on the sign of 
Fa. Our choice makes Fjo = E;. 
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and 
Fio = 0;Ao0 ~ oA; = Ej. (6.4.23) 
Finally, we must be able to write Maxwell’s equations in covariant 
form. First, the inhomogeneous equations involve derivatives of F on the 
left and current components on the right. The covariant expression of this 
must be d,,F"” on the left and j” on the right. Thus, 
ða FY" = Kj” (6.4.24) 
with « to be determined. We write out (6.4.24): 
aF?” + F” = Kj”. (6.4.25) 
Let v = 0: ð; F” = xj’, or using (6.4.23), we obtain 
V-E=-—-xj° (6.4.26) 
so k= —477. Now let v = €: We find 


0, F' + oF” = Kj’. (6.4.27) 


Since the form of (6.4.27) guarantees rotational invariance, it is sufficient 
to consider one value of £, say, € = 1. Then (6.4.27) becomes 


0,Fi, + OoF°! = Kj? 


or 
d:Fi + oF io = Kj’ 
or 
2Fo, + 03F3) + ðE, = Kj! 
or 


(d3Bz — ð2B3) + dE, = Kj’; (6.4.28) 
with x = — 4r, (6.4.28) gives 
~(V x B) + æE = —47j 


which is correct. Therefore, the four inhomogeneous equations are put 
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together into one four-vector equation: 
ða F” = —4nj”. (6.4.29) 


The homogeneous equations are also four in number, suggesting a 
four-vector equation. However, we know that the homogeneous equations 
involve axial quantities: 3B/ðt, V x E, and V - B. The unique axial vector 
consisting of first derivatives of F is ð, F“”, where F“” is the dual of F: 

QF MY = eF, o. (6.4.30) 


e¥”^7 is, of course, —€prao, Since one time and three space indices are 


raised to go from €,,,,, to €*”*”. The dual F*’ of F u» simply interchanges 
E and B. That is, 


~. 1. : 
ij — | „ijv Za k0 
pra e Fp = E” Fro 


= Eijk Ex 
and 
1 


ew E 
F? = e ae z CKeP ae = B;. 


Thus, we conjecture that the homogeneous equations are equivalent to 
dpet™F,, =0. (6.4.31) 
Written out, (6.4.31) is for u = 0: 
eo OF, =- e;jrðF= 0 or V-B=0. 
For w= J, 
E>? a, Fra = EO Og Fue + E€ ag Foe + ef a, F eo 

= € jke GF Ke — 2€ jke Ox Foe 

= 209B; + 26 jke de Er = 0 
or 


o+VXE= 0. 


A more familiar form for (6.4.31) is 


OP yy + OF ay T ð Fap = 0. (6.4.32) 
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To see the equivalence, note that (6.4.32) is antisymmetric in any inter- 
change. For example, interchanging pw and v gives 


Fpa t+ OF uy + Op Far 


which is the negative of the original expression. Therefore, we rewrite 
(6.4.31): 


0 = 3" g, Fro = eM a Fro tet a, Fy + E 9, For 


= e#”^ (3, Fro $ ðo Eva + On Fov), 


for each u. The sum over vàø simply multiplies the result by 6 (the 
number of permutations of three objects). Therefore, the quantity in 
parenthesis is 0. 

We close this section by studying the transformation laws for the E 
and B fields. We have, since F„» is a tensor, 


Fi = —— Roa (6.4.33) 


Let the primed observer be moving with velocity v in the x direction. 
Then 


10 r , 10 
o_ x +vx xX + Ux 
=—, x=, 6.4.34 
V1—v? Vi- v? ( ) 
y= y' and z = z', from which we can calculate dx°/dx’". 
We first calculate the longitudinal fields B, = F,, and E, = Fyo: 


ax? ax* 
n= a a Fos = Fy (6.4.35) 
y’ az 
and B, is invariant. 
, _ 8x7 Ox 
x0 a F ax 710° OA 
ax ðx? ax? ax 
= 1 10 Fo r 10 £ Ox 
ax’ ð Ox" Ax 
1 v? 
= (- z y2 1 1) Fo = Fyo (6.4.36) 


and E, is also invariant. 
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Consider next the transverse components 


79 A 
B' = Fi = ax Ox wk 
az’ ax’ 
ax* 
T y ZA 
ox ax? 
= Fix T — Fro 
+ ax’ 
1 
“Mia 
or 
; 1 
Bi = Fea (Br v X Er) (6.4.37) 
since 
(y x Er) = (pê, X (Ê E, + ĉ&E-))}y = “uke. 
Finally, 
„om _ 3x7 âx’ 
E, = Fyo= ay’ oxo 
Zn 
zj a 105 ¥A 
Ox ax 
~ Re ae 
1 
“Weare 
or 
A 1 
E= Vo (rt vx Br). (6.4.38) 


Suppose there is a magnetic field but no electric field in the O system. 
Then an object at rest in the O’ system would have velocity v in the O 
system, but would only experience the electric force in the O’ system, 
which is, for small v, ev x By = ev x B, as it should be. 

We observe that we can construct two invariants from the F,,, field. 
The first is 
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1 wu 2 2 
I= sara = E — B^. (6.4.39) 


The second is, in fact, a pseudoscalar: 


1 = 
h=- FuF"”=E-B. (6.4.40) 


One verifies easily that Z, and J, are left invariant by the above 
transformation rules for E and B. 

Another invariant is the phase of E or B; thus, for a monochromatic 
wave, 


kyxt=kex= kl ox! = kx" (6.4.41) 


defines k” to transform like x”. For a wave vector in the v direction, 


and since w = k, (6.4.42) 


This is the relativistic Doppler shift. For other angles, the formula is more 
complicated but straightforward, although some care must be taken to 
define the observation when 6@ is near 90°. (See Problem 6.9.) 


6.5. EQUATIONS OF MOTION FOR A POINT CHARGE 
IN AN ELECTROMAGNETIC FIELD 


We can guess various covariant equations of motion for a charged point 
particle. We narrow the list of suspects by requiring that they (the 
equations) be linear in the F,,, field and contain no derivatives.” 

We start from the four-velocity u” = dx"/dr. A four-vector acceler- 
ation is 


°The presence of first derivatives would, for example, result from an internal structure 
of the particle, such as a magnetic moment. 
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a covariant equation would have a four-vector on the right-hand side, 
linear in F,,,. The only possibility is F„ u”. We conjecture’? 


m— = kqF, su” (6.5.1) 


where q is the charge and the constant x will be adjusted to agree with 
the nonrelativistic limit. Consider first u = i: 


—m oy = «q(F,;u! + Fou’) (6.5.2) 
dt 
or 


= gea (E + (v x B);) 


so that x = —1. The particle equation is 


“Pp = q(E+v xB) (6.5.3) 
t 


where p is the particle momentum: 


my 


e 
The fourth component of (6.5.1) not independent, since 


du K , 
u n-A uF uw’ =0, 
™ m 


or 
d 
— (u”u,) = 0. 
an Un) 


This is as it should be: 


Observe that covariance requires four equations for four variables, the u,. The 
equations, however, cannot be independent, since u,u" = 1. The existence of such con- 
straints occurs often in relativistic theories. 
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The fourth equation is 


d : 
M— uo = —GFo,u' or m— =gE-v. 6.5.4 
ger ae aio Y a 


Equation (6.5.4) looks like an energy equation; the right-hand side re- 
minds us of work done on the charge, the left-hand side the increase in 
energy m/V1 — v?. It is, in fact, an energy conservation law, when E is a 
static field derivable from a potential E = -V¢. It is easily seen that 
(6.5.4) under these circumstances implies 


T(= +a) =: (6.5.5) 


6.6. RELATIVISTIC CONSERVATION LAWS 


We prove a theorem. Let Q” be a conserved vector field, that is, let 


ð Q = 0. (6.6.1) 
Then 


dx? 
vV 


“| O° dr = -| Q-ds (6.6.2) 
S 
and there is a globally conserved quantity 
Q= | dr Q° 
V 
provided there is no outgoing flux: 


[-as=0. 
S 
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The theorem is that Q is an invariant.'' We already know an example: 


the charge of a particle. 
Proof: Consider 


Q(t=0) = [aime ð O(n- x) (6.6.3) 


with 
1, ¢>0 


a) = {0 t<0' 


(6.6.4) 


n-x =nyx’*, and n, a unit vector in the time direction, so that n -x = x°. 
Then 


0 = 00) = | dx j'a) % 000" 


= fa'r 7°(x) 6(x°) (6.6.5) 
and 
Q= [axa 0). 
Consider a different observer O’. He calculates Q’: 


ox” 


ax'* ax" 


3 ð axe : 
Q'= [axe rer 6(n- x’) = | a's - i°(x) O(n -x') 


= faso 0,0(n- x’). 


But 


and 


"We consider a vector Q” for simplicity. The theorem also holds for a conserved 
tensor Q"", with 3 Q" =0. Then Q" = fyd Q”” will be globally conserved and a four- 


vector. 
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Q- a = | atx j" 3,002) = A(x" a) 
Since ð j” = 0, 4, can be placed to the left of j“ in the integral: 
Q-Q'= fasail "x)= O(n’ -x))]. (6.6.6) 


Integrating each derivative, we see that the surface terms all vanish, since 
for large x, j* vanishes and for large xo, both n- x and n’ - x are positive 
or negative, so that the @ functions cancel. 

Are there other conserved integrals of functions of the field? Evi- 
dently, yes—we already know about energy, momentum, and angular 
momentum. Consider first energy and momentum. We have seen that in 
particle mechanics they together form a conserved four-vector; we also 
know that in electromagnetic field theory, they both can be expressed as 
integrals of quadratic functions of the fields. It is natural to suppose, 
therefore, that we can write 


pe | dx Tt (6.6.7) 


where P% is the energy, Pp the momentum of the field, and T% a 
conserved tensor, that is, 


a7 =0 (6.6.8) 


in the absence of sources. 

This is, in fact, the case; 77" is called the electromagnetic stress- 
energy (sometimes energy-momentum) tensor. From now on, we shall 
call it the stress tensor, meaning the four-dimensional stress-energy-mo- 
mentum tensor, unless otherwise stated. 

From the electromagnetic conservation laws that we have already 
proved, we can identify all the components of T;*. From the energy 
equation, P} = f d*x TY, we see that T® is the energy density and 


p=- | axa, Ti: 


274 Invariance and Special Relativity 


thus, T}? is the Poynting vector: 


Tie = (= = s), (6.6.9) 
an ji 


We also know the momentum density 


pe (F x 2) 
an hi 
But, from (6.6.7), 
pr= Te, (6.6.10) 
so 
TË = TS, 


From the momentum conservation equation, we find 
= | deat? = -f d’x aTh 
and — T} is the Maxwell stress tensor: 
~ri= (EB, ~ 1676, + BiB, -1 8%,). (6.6.11) 


In all, we have a symmetric two-index object T 7", that generates a vector 
on integrating TS” over dx. It is, in fact, a Lorentz tensor. Its manifestly 
covariant expression is 


: 1 
F= (AEB t at Bagh), (6.6.12) 
4r 4 


Next, we calculate 0,75": 


1 
aT = e n“ Furð F” + Q” (6.6.13) 
T 


where 


1 
An QË = Fna Far + ; ap PoE a Fas (6.6.14) 
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= Feat agar F + aaPar) 


1 
= PE (Far + Fra + 3aFa)=0. (6.6.15) 


Thus, 

aT Y= Faj (6.6.16) 
and T*’ is conserved in the absence of sources. We know, however, that 
global energy momentum conservation holds in the presence of sources, 
if we add together the energies and momenta of the sources (particles) 


and fields. We now write the most obvious candidate for a particle stress 
tensor, T”, that we will add to T%”: 


Tp = mu*u"d (6.6.17) 
where d is the scalar density defined in (6.4.11): 
d = V1 — v? 8°(x — x,(x0)). (6.6.18) 


If more than one particle is present, or there are other interacting 
fields in play, then the single term in (6.6.17) must be replaced by a sum. 
We here confine ourselves to this simple system of electromagnetic field 
plus charged point particles. We shall see in Section 7.4 that for the 
relativistically invariant theories that we are discussing, one can always 
find an appropriate conserved stress tensor. However, the electrodynamic 
plus point particles case is particularly simple, in that there 


Te = Te eT (6.6.19) 


is, in fact, the correct conserved stress tensor. 
We calculate 0,7", remembering that 


a,.(u“d) = a,J” = 0. (6.6.20) 


Therefore, since u” depends explicitly only on time, not x, 


p 
ð TP = P u’d = m — d. (6.6.21) 
X 
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But, from (6.5.1), with x = —1, 


du” 
m— = ~q Fu 
dt zi 
and 
ð Th = -qF“u,d = —F"j, (6.6.22) 


where j, is the current density four-vector. Adding T = TF’ +T, 
from (6.6.16) and (6.6.22), we get 


TY : 
oy = Fal ~ Fy =O. 


Thus, we have a symmetric, locally conserved tensor 7”, from which we 
can construct a globally conserved energy-momentum four-vector: 


P= | dx T”. (6.6.23) 


In more general field theories, it is easy (see Section 7.3) to construct 
a stress tensor ®©“” that is conserved, 


a O" =0, 


but is not necessarily symmetrical. It is, however, possible (but not 
necessarily easy) to find a conserved symmetrical T#”. These techniques 
will be discussed in Sections 7.4 and 7.8. 

The importance of the symmetry is that it makes possible the construc- 
tion of six more global constants by defining a third-rank tensor 


Me = (x°T — xT’). (6.6.24) 
M*”* is conserved with respect to the u index: 
ap MH = T8 — THE = TAT =0 


provided T*’ = T}. 
Therefore, we have six global constants constructed as usual as 
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LY’ = f d?’x M™” = [ @xcer™ = x“T”). (6.6.25) 
The space-space components L,; define an angular momentum 
L= | d’xr xp, (6.6.26) 
where p‘ = T“ is the momentum density. The components L” are 
“= | d>x(x°T™ — x'T%) = Pix? — Px! (6.6.27) 


where P’ is the total energy: 


po = f d’xT®”, (6.6.28) 
x! is the center of energy, 
[ars T™x! 
Pree Suede (6.6.29) 
| d°x T” 


and P’ and x° are, as usual, total momentum and time. We thus learn 
that for a relativistic system, dLo;/dx° = 0, or 


dxi Pp 
Pa (6.6.30) 


This is as close to a center of mass theorem as one can come in a 
relativistic theory: The center of energy moves with constant velocity, 
v- = P/W. None of the other center-of-mass theorems of nonrelativistic 
mechanics hold. 


CHAPTER 6 PROBLEMS 


6.1 A point particle at rest undergoes an acceleration a = ia, T ja,, so that 


its motion is described by the equations x = 3.a,t?, y = 34,1’. 
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6.2 


6.3 


6.4 


6.5 


6.6 


6.7 


6.8 
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How does an observer O' moving to the left along the x-axis 
with velocity v describe the motion near x = y = t = 0? That is, how 
does acceleration transform under a Lorentz transformation? 


Using the result of Problem 6.1, that is, a = a,(1 — v”)*”, show that 
a particle starting from rest at t = 0, and undergoing an acceleration 
a(r) in its own rest system as a function of its proper time 7, travels 
a distance x = f sinh u(t) dr in proper time 7. Here, u(r) is the 
“proper velocity”: 


u(t) = | a(t) dr. 
0 

Show that the electromagnetic energy radiated per unit time by a 
point particle is an invariant under a Lorentz transformation; from 
that, calculate the rate of energy radiation by a point particle moving 
with velocity v and acceleration a. Then calculate the instantaneous 
rate of radiation of momentum for a particle moving with velocity v 
and acceleration a. (Hint: First show that the momentum radiated 
in the rest system is zero.) 


From the known electrostatic field (or potential), find by Lorentz 
transformation the electric and magnetic fields of a point charge 
moving with constant velocity v with |v| < c. These fields must, of 
course, be expressed in terms of the space and time coordinates used 
by the observer with respect to whom the charge has the velocity v. 


The production of a m* meson from a proton by a photon is de- 
scribed as the process 


ytpon' tn. 


At what photon energy E, on a proton at rest will the threshold for 
the process be reached? Call the respective masses My, Mp, Mn, and 
m, =0. 


Consider a Lorentz transformation in an arbitrary direction v. Write 
a three-vector equation for the transformation r’ = f(r, v, t) and a 
three-scalar equation £’ = g(r, v, t), where f is a vector function of r, 
v, and ¢, and g a scalar function of r, v, and t. 


An observer O’ has a very small velocity v, relative to O; a second 
observer O” has a very small velocity v, relative to O’. 

Now consider observers O' and O” who have interchanged the 
order, that is, O’ has Vy, O" has v,. To lowest order in v,v,, what 
is the transformation that brings the observers O” and O” into coinci- 
dence? 


A charged condenser moves with constant velocity v. Call the charge 
density p(r). 
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(a) In terms of p, write expressions for the electric and magnetic 
fields generated by p. 


(b) Write an expression for the force per unit volume on the moving 
charge density. 

(c) Integrate the expression you found in (b) to find the self-force 
on the condenser. It should be zero. 

(d) From your results in (b), write an expression for the torque per 
unit volume on the moving charge density and integrate it to 
find the self-torque. The answer should involve the tensor 


{= ears 2) 5(2)p(1) d°x2 dn. 
2 1 


Let the condenser consist of two oppositely charged very small 
spheres, separated by a vector d. In terms of the total charge Q 
on each sphere and the vector d, write an expression for the self- 
torque. Note that it is not zero, and so should have (according 
to prerelativistic theory) furnished a way of detecting absolute 
motion. Experiments, of course, failed to do so. Can you say 
why? After all, the field calculations are correct, as is the Lorentz 
force. 


6.9 Study the relativistic Doppler shifts by noting that the phase of an 
electromagnetic wave is invariant. Thus, 
ik yx'* 


ik yx 


e =e 


and kò must transform like x*. Thus, for a Lorentz transformation 
in the x direction 


k, ~ vw w — vk 
kL = ———, ki = ky, ki =k,, ‘= 
Vil —v? á V1 — vk? 


with k the wave vector and w the frequency in the rest system of the 
radiator. Show that the frequency measured by an observer who sees 
the radiator moving with velocity v is 


,. oV1-v? 


o= 
1 +vcos 6 


where @ is the angle the radiation makes with the motion of the 
radiator (as seen by the observer). 


6.10 (a) Calculate the ratio r of L, the rate of energy loss through radi- 
ation, to G, the rate of gain of energy from the accelerating field 
in an electron linear accelerator. Express v = L/G in terms of the 
accelerating field E and natural constants. Assume the electron is 
relativistic so that v =c. 
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(b) Calculate the ratio r of the energy loss per cycle of a relativistic 
electron in a circular accelerator to its energy. The electron has 
energy E, the average magnetic field is B, and the machine 
radius is R. Assume v ~ c so that the frequency w ~ c/R. Elimin- 
ate R and so express the ratio in terms of E, B, and natural 
constants. Finally, express the ratio r in terms of R in kilometers, 
B in tesla, and E in electron rest energy units. Explain from 
these results the advantage of linear accelerators over circular 
acceleration for ultrarelativistic particles. 


6.11 The functioning of an accelerator requires that the chosen orbit be 


stable with respect to small deviations. Consider a particle beam in 
the z direction and a stabilizing electrostatic potential ¢ = kx*/2 in 
the x direction. Unfortunately, since V*¢ = 0, the y potential must 
be destabilizing. The overall potential must be a quadrupole: ¢ = 
k[(x7/2) — (y7/2)]. The alternating gradiant principle takes advan- 
tage of the fact that a stabilizing passage followed by a destabilizing 
one can be stabilizing with a proper choice of parameters. Since the 
potential is harmonic, one can calculate the matrix U2; that takes 
the vector (xı, ¥ı/w) to (%2,%2/w) and combine the U matrices for 
different lengths of potential, a and b: 


are = UUs: etc. 


Let Uz, describe a converging sector, with w(t, — ti) = ~, and U3 
a diverging sector with w(t; ~ h) = 6. Here, w% = k/m(1 — v2)"”, 
provided x and y are much smaller than v,, the beam velocity. 


(a) Calculate U2,(¢) = Usl) and U32(8). 

(b) Combine the sector U’s as described above to give U= 
Uae) Us2( 8) Uri(¢). 

(c) Show that det U = 1 and its eigenvalues are e*"", where cos u = 

cosh @cos 2¢ and |cosh @cos2¢| <1. For |cosh 6cos 24} > 1, 

the eigenvalues are e**, where cosh A = {cosh 0 cos2¢| and 

cos 2¢ > 0 or —e** when cos 2¢ <0. 

Note that U is neither unitary nor Hermitian, so the apparent 

stable behavior coming from the eigenvalues can mask actual 

growth. To illustrate this phenomenon, find the two components 

of the vector U (b) and show that one of them can actually be 

arbitrarily large, even for |cosh @cos2¢| < 1. Then show how- 

ever that for small 2¢ = ô, the sector is stabilizing. 

Show that magnetic stabilizing via a quadrupole magnetic field 

leads to the same equations for x and y. 
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CHAPTER 7 


Lagrangian Field Theory 


Tn classical field theory, we introduce Lagrange equations and Hamilton’s 
principle for several reasons. These include the special simplicity of using 
generalized coordinates and eliminating constraints, but most important, 
the ease of building invariance principles and the corresponding conser- 
vation laws into our equations. In quantum theory, the canonical formal- 
ism plays such a crucial role that the use of Lagrangians is often indispens- 
able. 


7.1. REVIEW OF LAGRANGIANS IN MECHANICS 


The Lagrangian is a function of generalized coordinates and velocities, qa 
and ĝa: 
L = L(qay Gast), (7.1.1) 


where g, is the derivative with respect to rt. Here, t can be a variable, 
usually the time, that is used to parametrize the trajectories of the system. 

The second-order equations of motion follow from Hamilton’s prin- 
ciple, that is, from the requirement that the action 


fy 


S(qa) = i L(qalt), Galt), t) dt (7.1.2) 


a 
be stationary for infinitesimal variations of ga(t) > qa(t) + dqa, Or 
qalt) > qalt) + €nat) (7.1.3) 


provided ne(fi) = na(t2) =0. Here, e is an infinitesimal and y,(t) an 
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arbitrary function of t. Sometimes, we will use the common shorthand 
en,(t) = qa; however, in case of confusion, it is often best to go back 
to (7.1.3). For example, (7.1.3) makes it obvious that (d/dt) 6q, = 
8(dq,/dt). The notation S(q,) indicates that S is a functional of q,(t); that 
is, it is a number that depends on the functions q,(f). 

We carry out the variation 


8S = S(qa + qu) — Slaa) 


~ | (Emt 2 ti) 


ðL doaL ðL 
= | (5-22) sq, + tE ig, 
dqa dt Ga dda 


n 


12 
t 


(7.1.4) 


Since the integrated term vanishes at the t boundaries, (7.1.4) shows that 
the condition 5S = 0 for any ôq, requires 


=--> =0. (7.1.5) 


For obvious reasons, the left-hand side of (7.1.5) is called the 
variational derivative of S, written as 6S/6q,: 


aS ðL d ; 
= Pe -È (7.1.6) 
Oda qa dtdg, dt dg, 


and so on, if the Lagrangian contains higher derivatives." 
The relation between the symmetries of the Lagrangian and conser- 
vation laws is given in first instance by Noether’s theorem, which follows. 
Suppose that when q,— qa + ôqa, 5L = dbA/dt, where ddA/dt is a 
time derivative of a function of the q’s and q’s. Then, independent of the 
equation of motion, 


dôA _ 
dt 


2 |= ue 0 (7.1.7) 


a + la 
Iqa ôa 04a a | 


or, using the equations of motion, we get 


‘If the Lagrangian does have higher derivatives, the action principle most economically 
formulated states that 6S depends only on the variation of q, ġ, etc. on the boundary. 


7.1. Review of Lagrangians in Mechanics 283 


“(255 ite 84) = 0 (7.1.8) 


and we have a conserved quantity 


80 =D aqu - 5A (7.1.9) 


a 


which is independent of t. 
We list the major examples from classical mechanics: 


1. 5x, = ô, a fixed displacement, identical for every particle, with L 
invariant. This would hold for motion in a potential depending 
only on the relative position of particles. In that case, we would 
have 


80 = Lp, d= DVL (7.1.10) 


and we have total momentum conserved by virtue of translational 
invariance. The symbol V,, stands for the operator 


a ð ð 
= By tê —. (7.1.11) 
Xa í Ya 02, 


ka T e, 


2. 5x, = ôO X xa, a fixed rotation about the origin, identical for every 
particle, with L invariant. 6Q is given by 


80 = > Va, L- 80 X x, (7.1.12) 
or 
6Q = 50-2 x, X Va L (7.1.13) 
so that 
L= Dx, x VL, (7.1.14) 


is conserved. We identify (7.1.14) with the angular momentum. 
(Do not confuse L, the angular momentum, with L, the Lagrang- 
ian.) 

3. Suppose L is translation-invariant in t—that is, it depends on ¢ only 
through its dependence on q and q. Then with ôqa = qaôt [so that 
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g(t) q(t + 8t), G) — dlt + &1)] 


dL 
sL =F (Gu + Eg) = a 


a Oda ð a dt 
and 
w=}? ja- L (7.1.15) 
a dda 


is conserved. We recognize in (7.1.15) the conservation of energy. 


. Our last example is a Galilean transformation to a moving ob- 


server: ôx; = 5v,t, with ôv; the same for all particles. Then, sum- 
ming over particles a, we obtain 


ea De a AÈ maxaiðu:) 


a OXai a OXqi 


for a Lagrangian with translation invariance, whose only velocity 
dependence is in its kinetic energy. So, our conservation law here 
is 


(2 A t— ` MaXai = constant 
a 


a OXai 


or 


(z make) t- > MaXai = Constant. (7.1.16) 


Equation (7.1.16) tells us that the center of mass moves with 
constant velocity. 


7.2. RELATIVISTIC LAGRANGIAN FOR 
PARTICLES IN A FIELD 


Relativistic invariance may be achieved by choosing an action that is itself 
invariant. The obvious choice for a relativistic free particle action is the 
integrated proper time interval for the particle: 
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TA o? 
R 
S,= -M | ar= -m | Väryæ = -m | do [Se (7.2.1) 
da do 


T} Ol 


where M is a constant with the dimensions of mass and ø any increasing 
function of 7. It is important to distinguish between dynamical variables 
(the x” here) and Lagrangian integration variables (the o here). Notice 
that x°, the physical time, is here regarded as a dynamical variable. Since 
the final result is required to give us x as a function of x°, our equations 
must make this possible. We shall see that this is always the case. The 
invariance (with respect to the choice of a) is called reparametrization 
invariance. 

In order to include an electromagnetic interaction, the simplest choice 
is to add to S the invariant 


dx” 
Sem. =E dx” A, =E Wo Ot da (7.2.2) 
oC 


where E is a constant with the dimensions of charge. 
Note that (7.2.2) is gauge-invariant: A gauge change, 


A, A, + 0,A(x), 


changes S by 


o7 

H 

as= E | 8S ae = | dA =A) - A0) 
ax" do 


so that a variation of S keeping the end points x“(o,) and x“(o2) fixed 
is unchanged by the change of gauge. 
The Lagrange equations, with o as the independent variable, are 


d ðL ðL 
de (a) Er (7.2.3) 


do 


u H 
tasm yp ay a (1.2.4) 
do da do 


where 
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so that 


dX 
d do _ ndx dA, 


-— EA, | = E ee (7.2.5) 
da ldx“ “dx, do 0x 
da da 


since A,,(x) is evaluated at x = x(a); 0A,,/ax* is the space-time derivative 
of the field evaluated at x = x(o). Carrying out the ø differentiation of 
A,, we have 


AX pw 
gili ane M ti) 
do [dx dxa do ax" aa ax* 
da do 
a ge (7.2.6) 
da 


There are two simple choices for da: either do? = dx” dx, = dr’, or do = 
dxy. Either yields the equation 


A 
M? (=) pep (7.2.7) 
dt\ dt 


so that (7.2.6) agrees with (6.5.1), provided we set M = m, the particle 
mass, and E = q, the particle charge. 

There is an alternative action, also Lorentz-invariant, that gives the 
same final equations for a particle in a given electromagnetic field: 


H dxp f dx” 
se ey a ae Ae, 
do do OR (aaa i 


corresponding to a Lagrangian 


m dx” dxu dx" 
2 do da da 


(7.2.8) 


The Lagrange equations are 
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d ( aX pn ) dx* dA, 
— |m— gA] = -q—-— 7.2.9 
do do da 4 do ax" ( ) 


or 


i dxa — (Ae 24a) dx* 


: 7.2.10 
axò ax”) do ( ) 


The action S is not reparametrization-invariant; therefore, do is not 
a free variable, but is determined by (7.2.10). Multiply (7.2.10) by dx“/do 
to obtain an integral of the motion: 


H 
(= ee) si (1.2.11) 
do \ do do 


so that dø must be a constant multiple of dr. A change in the constant 
multiple can be compensated for by redefining the coefficient m. Evi- 
dently, o = 7 gives us the correct equation, as written in (7.2.10). 

Note that the “energy” constant that we would obtain from the new 
action is 


H H 
Ze qa) S- - ee 


, (7.2.12) 
da 2 do da 


just the integral we found above in (7.2.11). 

There is still, however, Noether’s theorem that may be applied to 
time displacement to obtain a physical energy integral. If the Lagrangian 
L is invariant under x° -x° + 5 (6 a constant), then there will be an 
integral of the motion. For L to have this invariance, A, must be indepen- 
dent of x°. The constant of the motion is then 


Sie os (7.2.13) 


dx? 
(in) 
da 
L depends on dx°/da as 


0\2 o 0 
L=-% (=) +q A + non a terms (7.2.14) 
2 \do do do 
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so 
w= 6 = (m™= - a0) 5 
Uz) 
and with do = dr 
W=- Fe + a$), (7.2.15) 
V1-v? 


in agreement with our earlier formula (6.5.5), as it must be, since the 
equations of motion are the same. A technical point: (7.2.15) is a constant 
of the motion, but it is the negative of the usual energy T + V. For a 
particle in an external field, the sign is irrelevant. For a system of inter- 
acting fields and particles, the energy must be obtained from the full 
Lagrangian (or better, full action) for the particles and fields. We will 
discuss this problem in Section 7.4. 

Although the Lagrangians (7.2.4) and (7.2.8) for a particle in an 
electromagnetic field lead to the same equations of motion, this is a special 
circumstance and does not necessarily hold for other interactions. In parti- 
cular, adding a linear gravitational interaction to (7.2.8) leads to a 
simple —and experimentally correct—linear theory, and, as we shall see, 
to Einstein’s equations for the gravitational field; following a similar proce- 
dure with (7.2.4) does not. From here on, we will work only with exten- 
sions of (7.2.8). 

We wish here to complete the list of Noether currents for the case of 
a Lorentz-invariant particle Lagrangian. Thus, consider an action 


S = | dor( x", 55) (7.2.16) 


which is invariant under the Lorentz transformation 


xt x" = N Ye (7.2.17) 
and 


dx” dx" yn dx“ 


=> re as 
da do do 


(7.2.18) 


with do an invariant, L itself will be invariant. Thus, the Noether theorem 
will give us the conservation (with respect to o) of 
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aL 
80 = dx", (7.2.19) 


(i) 
a Santis 
do 


where ôx“ is the change x’“ — x“ for an infinitesimal Lorentz transforma- 
tion. We take as an example the “new” Lagrangian 


dx" d 
eee (7.2.20) 
2 da do 
Thus, 6Q is 
dx” 
5Q = -mnu Z 8x". (7.2.21) 
do 


The combination 7,,6x” is particularly simple. Since 6x“ = 
(A*, — ôa) x7, we learn from (6.2.31) that 


Narx” = Nyr(AYs — 8" ,)x" (7.2.22) 
= EpaX". (1.2.23) 


Returning to (7.2.21), we obtain 


v 


5Q = — m— e,gx?, (7.2.24) 
do 


so we have six tensor constants of the motion: 


d v a 
L= m(x" = -e= 


7.2.25 
da do ( ) 


where following (7.2.11), da must be taken to be proportional to d7, the 
proper time. 

In this trivial free particle case, we recognize in L“ the three com- 
ponents of angular momentum, and in L‘° = constant the three equations 
of motion, x’ — (dx'/dx°)x° = constant. The interest of the expression 
(7.2.25) is that in the case of interacting fields and particles, (7.2.25) will 
again emerge as the particle contribution to a general conservation law. 
For an example of this, see Problem 3.3. 
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7.3. LAGRANGIAN FOR FIELDS 


Fields are systems with an infinite number of degrees of freedom: the 
fields and field time derivatives at every point in space at a given time. 
The Lagrangian is then itself a functional—an integral over space of some 
function ¥ of the fields and their derivatives. The function Z is some- 
times called the Lagrangian, although it is more appropriately called the 
Lagrangian density. Thus, we have 


L= | PAAY) Aether) (7.3.1) 


where the ya's are all the fields we are considering and the 0, Ys their 
derivatives. The index a can refer to a species of field, or to a vector or 
tensor component. The action S is now 


S= f dxyL = f dL. (7.3.2) 


Since the four-dimensional volume element d*x is Lorentz-invariant, the 
action (and hence the equations of motion) will be invariant if £ itself is 
an invariant function of the fields and their derivatives. 

Again, here we must distinguish between dynamical variables (the 
fields at every point of space) and the Lagrangian integration variable x” 
(the time and space point at which one asks for the value of the field). In 
order to avoid confusion, we will from now on call the dynamical space- 
time coordinates of the particles y$, or sometimes y”. We will keep the 
variables x“ for the arguments of the fields. 

By analogy with ordinary Lagrangians, we know that quadratic 
functions of the fields in Z will lead to linear equations; higher derivatives 
than the first will lead to higher than second-order differential equations. 
We confine ourselves here to first derivatives and second-order equations. 
Higher derivatives are suggested in Problem 7.6. The rule for using the 
action is the same as for the point Lagrangian: The action must be station- 
ary for arbitrary variations 6y, that vanish on the integration boundary. 
So, 


g aL 
as= 3 fa f Sit Blan tte) |; 7.33 
a 4 å Pa % 8(0,. Wa) i i ; l 


Integrating by parts and dropping the boundary term, we have 
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OL a ƏL 
ôS = d*x pal — - ———— | = 0 3. 
as kero 


or, since 6, is arbitrary in the interior, 


aS of a af 4 Gay 
OW, Ya 0x" 0(0, Pa) 


the Lagrange equations for the fields. The definition of functional deriva- 
tives has been extended in (7.3.5) to apply to fields. 

The Lagrange equations (7.3.5) permit the construction of a con- 
served stress-energy tensor, called canonical: 


OL 
wy Oy NE E (7.3.6) 
a (ð a We) 
for which 
ðO" = Io, 22 ary, + = A rs 
a a(d a Pa) (ð u Wa) 


Using (7.3.5) yields 


dL Of Of 
a oof ary da pa = — a", 
e = OW a (ð pHa) ġ OW, 


dL 
(ða Wa) 
= — [0 ]L (7.3.7) 


TOYA)! ~[a']¥ 


where the notation [ð"] Z means that the derivative is only on explicit 
space time dependence of £. If [0] #=0, 0,0"%”=0, and we have a 
conserved tensor. 

We note here that the conservation of (7.3.6) is a special case of 
the Noether theorem applied to field Lagrangians. Thus, assume that an 
infinitesimal transformation 


Yo Wo + OW, (7.3.8) 
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leads to a change in £ of 
SL = ð (8A") (7.3.9) 


for some SA”. It follows that 


OL 
an bu, — aa") =(Q, (7.3.10) 
dð Wa) 4 
since 
dL 
SL = — Wo 
A W aðu m” a 
OL ) 
=ð vl; 7.3.11 
len is 


Combining (7.3.11) and (7.3.9) yields the conservation law (7.3.10). The 
vector in (7.3.10) that satisfies the conservation law is called the Noether 
current. 

Equation (7.3.6) for @“” follows by setting dy, = d,W,6x*, where 
dx” is a constant increment in x^ and 6A” is 28x”. This will be discussed 
more fully in Section 7.4. 

The conservation of ®“” makes possible the definition of a conserved 
four-vector that we identify with the field energy-momentum: 


P“ = f ero, (7.3.12) 
such that 
dP" 
Pa = (7.3.13) 


The tensor ©*” is not symmetric, except for scalar fields. If we could 
find a tensor 7"” that was both conserved and symmetric (and we can), 
we could, as discussed in Section 6.6, define a tensor field 

MEMS = xT = x TH, (7.3.14) 
which would then be locally conserved: 


a,M* =0, (7.3.15) 


leading to the global conservation of the angular momentum tensor 
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L” = | dx M?vA = fax Gr — xT”); (7.3.16) 
that is, 
dL” 
=0. 7.3.17 
dx” l i 


Equation (7.3.17) shows the general value of finding a symmetric 
tensor T””. 

We next show how a conserved, symmetric stress-energy tensor can 
be constructed for a Lorentz-invariant Lagrangian density. Since the 
canonical stress tensor @”” is symmetric only for a scalar theory, this 
is a useful procedure. We start from 


aL 
HY = ae — n” LE (7.3.18) 
ald uaPo) 
and add to @”” a tensor 
80H” = a,” (7.3.19) 


where ©*#” is antisymmetric in A and u, so that 
TH” = OF” + 60%” (7.3.20) 


is conserved (since 0,,0,®*"" = 0), and P” = f d’xT™ is left unchanged 
(since f dxa p” = [ dxo” = 0). 

In addition, it will be shown that 6*”” may be chosen so that T*” in 
(7.3.20) is symmetric under the exchange of u and v. We proceed by 
deriving an identity that follows from the Lorentz invariance of the La- 
grangian density, (po, ðu Po). 

Under a Lorentz transformation, the fields w, transform so that 


a = Sarthe (7.3.21) 


where Sas is a matrix representation of the Lorentz group. If the transfor- 
mation is infinitesimal, specified by €,,, as in (7.2.23), then the matrix S$ 
will be of the form 


S=1+6,,3"" (7.3.22) 


where each £”” is an antisymmetric matrix in the (a, b) space. We give 
some examples: 
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1. wascalar field: $ = 0. 
2. w a spin-one-half field: 


yer = ; (7.3.23) 
where a” is the set of six relativistic Pauli matrices: 


H aY 
PREEN (7.3.24) 
2i 


3. w a covariant vector field w,. Then 
pa = =N ena. (7.3.25) 
On the other hand, from (7.3.22) 


bby = ee dt” a o (7.3.26) 
so that 


ser = 5 (Bn ~ Ben") (73.27) 


correctly transforms w. 
Similarly, a second-rank covariant tensor field transforms like 
8 Wap = =e" Warn = e”, Wap’ (7.3.28) 
so that 
0 ar aie O A E ox ys re (7.3.29) 


with 5#”,” given by (7.3.27). 
The Lorentz invariance of the Lagrangian -2(, ô y) requires that 


Of aL 
— ôy + ê(ð y) =0 7.3.30 
ae Y E (aap) ( ) 
where 
6 = ep” y. (7.3.31) 


Note the abbreviated notation: 6% and dL/aw are vectors in the (a, b) 
space in which È is a matrix, as are 5(d,,) and 0-£/0(d,,W). Note that if 
w& is covariant, 0L/ay is contravariant and vice-versa. 

With 


7.3. Lagrangian for Fields 295 


OL 
ad.) 


=p", (7.3.32) 


the Lagrange equation applied to (7.3.30) tells us that 
(0,p") dh + p” y) =0. (7.3.33) 
The transformation law (7.3.21) together with (6.2.31) informs us that 
eld p) = 0,6 — €” dL. (7.3.34) 


This follows from the recognition that 6(d,,) is not equal to ô „ôy; the 
added Lorentz index u must also be transformed. Thus, (7.3.33) becomes 


alpy) ~ pn erp = 0 (7.3.35) 
or 
dalp Eur 2” Y) = pen, a (7.3.36) 
so that 
Ma’ = vat 
ee = d,(p? Ey). (7.3.37) 
We return now to 
Or’ = pay — nL (7.3.38) 
and note that with 
PE? = (pT E” — pE — p°") y, (7.3.39) 
TH” = OH” + 9, p7” (7.3.40) 


has all the desired properties. The divergence of the first term of (7.3.39) 
added to p“a’w produces a (u,v) symmetric sum. The second and third 
terms of (7.3.39) are already (u,v) symmetric, as is the *” term in @””. 
Therefore T“” is (u,v) symmetric. However, 7#” is (o, p) antisym- 
metric, so that the new symmetric T“” is conserved and leads to the same 
conserved energy and momentum as @””. 

We consider an example: a hypothetical scalar field. 

The simplest consistent Lagrangian density is y*, where y is the scalar 
field. This choice yields the variational equation ẹ = 0; this is not a very 
interesting result. The next complication is to add a derivative. Since £ 
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must be Lorentz-invariant, we can only add 


pga oe gee Ob (7.3.41) 
ax” ax” 
to obtain 
g= Sayay — ey (7.3.42) 


The arbitrary constant 3 determines our units, which here and from now 
on are rationalized. This saves a lot of 4m writing! The constant 
H= Vu? is real and is, as we will see, the minimum frequency at which 
the field will oscillate. 

To derive the field equations from the Lagrangian density, it is useful 
(only this time) to go back to the more explicit notation 


a pay = duhn ay 


so that 
G 
L Sg (7.3.43) 
aldap) 
and 
< 
de oe ee (að, + pjp = 0. (7.3.44) 
alep) ay 
We recognize 
2 
a”ð, = = SN 
(x ) 


Equation (7.3.44) is, of course, the familiar wave equation with solu- 
tions e™ "a, where 


-kik =p? +k? - (kY =0. (7.3.45) 


The canonical stress-energy tensor for this theory is 


a PENEI E 27 
pa = av wa’ n( wor H y ) 


‘ (7.3.46) 


which is symmetric and, therefore, entitled to be called T°”. The four- 
vector P” is given by 


7.3. Lagrangian for Fields 297 


A S 2 
piz f [wary 2 E] dx (7.3.47) 


with 


= 1 | axli? + (uy? + pè], (7.3.48) 


clearly showing the need for positive u’, since otherwise P° would be 
unbounded both below and above and the system would be unstable. 
The momentum is 


Pi= [ruy d’x or P= = fax Vy. (7.3.49) 


The angular momentum density I(r) is r x p(r), where p is the momentum 
density. That is, the angular momentum Ly contained in a volume V is 


Ly=- Í wr x Vy d’x, (7.3.50) 
V 


and the angular momentum flux is obtained from the equation 


a Li = f erar = oT x!) 
X 


= f PaT + (a,T”)x'] 


= fasar -= Tx). (7.3.51) 


S 
Thus, the loss (radiation) of angular momentum through a surface S is 


L} 
dt j 


-d Í dS(T*xi — T”x’). (7.3.52) 
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7.4. INTERACTING FIELDS AND PARTICLES 


Following the work in Section 7.2, we use particle coordinates y‘/,(a,), 
with o an invariant parameter to be determined, as, for example, by 
(7.2.11). In what follows we drop the label p, with the understanding that 
we must sum over all participating particles. 

The particle Lagrangian L, is a function of y,(7) = dy,(a)/do, and 
the particle action is the functional 


S, -|t do = — ™ | jo) (a) do, (7.4.1) 


as given by (7.2.8). 
The field coordinates w(x) are functions of space and time. The field 
action is the functional 


Sy = | d*xLy(Wo(X), Ou Pal) (7.4.2) 


where £, is the Lagrangian density. For a scalar field, we have seen in 
(7.3.42) that we may take 


p? 


1 
Ly = dpp E. (7.4.3) 
2 2 
The Lagrangian corresponding to £ is 
L(t) = | a*x L(x) (7.4.4) 


that is a function of t, whereas the particle Lagrangian 
Lp = —m/2 dy"/do dy,,/do 
is a function of ø. 


It is therefore necessary to consider the action as the fundamental 
functional. The action for the scalar field particle system will be 


S=Sy,+S,+S, (7.4.5) 


where S; is the interaction action. We have seen in (7.2.2) how S; must 
be chosen to give the correct equation for charged particle motion in an 
electromagnetic field: 
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d u 
S,= q | do 4,0) <a (7.4.6) 
do 
We see that S$; is also a functional of the field A, (x) by rewriting S;: 
4 dy” .4 
S;=q\d rado D (y — x) A(x), (7.4.7) 
[em 


in which form the field variable A, depends on the field point x, as is 
required by the Lagrangian procedure. 
In general, we will have 


Sy = fass, (7.4.8) 
Sp = f do Lp (7.4.9) 
and S$; with two equivalent forms: 


Sı = f doL, (7.4.10) 
as in (7.4.6), or 

S= fasg, (7.4.11) 
as in (7.4.7), the first form appropriate to the y“(o) equation, the second 


to the w(x) equation. 
The Lagrange equations for the combined (x), y” (0o) system are 


AL, +L) HL, + L) 
aanp) ow 


=0 (7.4.12) 


H 


and 
d a(Lp +L) allp + Lr) 
do ay" ay” 


0. (7.4.13) 


The interaction action S$; in all the cases we will consider is linear in the 
Wa S; in first approximation, the field action S, is bilinear in the w,’s and 
6,.%.’s. Therefore, the overall sign of S;, which is equivalent to the sign 
of the coupling constant, is irrelevant, since it can be changed by a redefi- 
nition of pa : Ya —> — Ya. By the same token, the signs of S, and S, alone 
are irrelevant. The important sign is the relative sign of S, and S, and, 
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if there are several independent S,’s and S,’s, corresponding to several 
fields and particles, their relative sign. We choose the relative signs so 
that the energy of the noninteracting system has a definite sign, which we 
normally take to be positive. 

If the equations of motion are translation-invariant in space and time, 
we can apply Noether’s theorem to find the conserved energy-momentum 
four-vector. The translational invariance is with respect to the transforma- 
tion 


yr — y” + 8y" (7.4.14) 
with dy” a constant four-vector, and 


Ua (x) > W(x — dy) (7.4.15) 


or, for an infinitesimal transformation, 
Ya (x) > Wax) — by" Oy Pala). (7.4.16) 


As in (7.3.9) and (7.3.11), we calculate 5S in two ways. First, using 
the Lagrange equations, we get 


felon forge) 


In the d*x integration, we can carry out the time integral from 1, to tz; 
the space components of 0, integrate to zero. Similarly, we carry out the 
do integral from a, to a2, where a; is such that y°(o,) = t; and y'o) = 
to. ôS becomes 


8S = [fe PE A “4 Obs t L) pyi | (7.4.18) 
aloka) ay” a2 
since 4L aldo Pa) = 0 
Second, remembering that ôy” is constant and, as in (7.4.1) and 
(7.4.6), that $; and S$, are invariant under the transformation, we find, by 
a direct calculation 


SES e Ai 0 3 |” 
ôS = fo ða £y Ax = —Sy EZ 
2 


(7.4.19) 


Equating (7.4.18) to (7.4.19), we find with Spa = —8y"d, Ya 
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— ôy” a, ng 2t] — Sy" 
7 {| 7 d(doba) ay" oe 


(7.4.20) 


with P, a constant: 


_ {43 d oLy _ 50 0. 
P< fa Ps AAEE. zA -z let Li). 0420 


Equation (7.4.21) gives us a rule for extending the field theoretic 
canonical formalism of (7.3.6) to include particle degrees of freedom, the 
yuo). 

We return to the example of the scalar field, considered earlier in 
Section 7.3, but now interacting. We choose the action (see Problems 7.1 
and 7.2) 


ier asy aw) mf dy" dy, 
s [a's] ; 5 | do EL + hy). (7.4.22) 


The equations of motion are 


dy* dy, 
=a a" Ge — p? -F fa 5 0 (7.4.23 
aw — pp re are | (x-y)= ( ) 
and 
d (dy, ) m dy^ dy, p(y) 
| e (1+ — a A gi 7.4.24 
ma do OD) -F do do ay" ey) 
The “W” conservation law is for 
dy^ dya (dr 
a = 1+ 7.4.25 
dode (1 + gyl) = (doy? N gy(y)) ( ) 


which we may set equal to 1. The energy momentum vector Pa is 


zN + gu(y)). 
(7.4.26) 


a 


¢ Ney Pee A 4 
P, = | a'x{ 2 E - a ayog = py Mh sm te 
ax ax 2 
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Note the positive energy 


i2 2 2.42 1/2 
y= [a's 4 e TEN a s aan 


provided the expression (dr/da)* (1 + gy) = | behaves properly. 

If we return to (7.4.21), an obvious question arises next: Is there a 
conserved tensor @“,,, of which P, is the space integral of the 0v compon- 
ent? The answer is yes. It can be written down by inspection. It is 


__ ah, 
” A(0u Wa) 


H 


Bula ~ By Ly = | tÈ (Lp + Li) SE- y) do. 
y 
(7.4.28) 
It is left as Problem 7.4 to show that ©”, has the desired properties. 
The totally contravariant form of 0",, 


__d£y 
AO, Pa) 


© MY 


u Ô 
0" ta — nh” Ly = [> Ag eee t(x- y) do, 
Y» 
(7.4.29) 
is, in general, not symmetric. 
The next question: Can we always find a tensor ô®”” such that 


TY” = OF + 60H” (7.4.30) 


has the desired properties listed in Section 7.3? That is, is it symmetric, 
conserved, and having the same P“ integral as @“”? Again, the answer is 
yes. The construction technique parallels the treatment leading to (7.3.40). 

We make use of the fact that Z, and L = Lp + L; are separately 
invariant under the Lorentz transformation pow +e,,2"°"W, as in 
(7.3.22), and 


yh yh + ea y>. (7.4.31) 
From the invariance of £, follows 


ray ôy + p* 6(d, yp) = 0 (7.4.32) 


where p” is still given by (7.3.32). From the Lagrange equation for Y, we 
learn that 
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aL, ôL 
ae Í — d,p" =0, (7.4.33) 


aw d(x id 


so that, with 5(0, Y) = ô ôy —e*,,d,, (7.4.32) becomes 


LK) § H — ph A = 
(d,p") db — [spate our ð ôy -pře ayp =0 


or 


ay, (7.4.34) 


êL 
ð (p dw) = |d 
(p ôy) (eae 
and from (7.3.22) 


ossur v "aty — p*a"p) 
Edp a” = ex» | do af 56, ee) 
H p y H Syla) y H 2 
so that 
syne vy poty- poy 
ðo(p7 E” p) = | ao i +=. (7.4.35) 
sre ) 2 
We define ®’"” as in (7.3.39) and add 6@”” to ©””, where 
60%" =a, Pr" (7.4.36) 
and 
TY’ = ©!” + 60%". (7.4.37) 
The symmetry properties of ®°”” show that 6@”” is conserved: 
ðO” = 0 (7.4.38) 
and the energy-momentum vector P” is unchanged by 80%”: 
[ax 50” =0. (7.4.39) 


It must next be shown that T“” is symmetric. We sce that the last term 
on the right of (7.4.35), (p’a"w — p#a” y)/2, added to the first term of 
@*” in (7.4.29), p*a”’ ys, is symmetric. 
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There remains 


8T" = — eas 54(x —y)dot [2 Bux) — X” ada. (7.4.40) 


From the invariance of L, we find 


aL ðL 
Eur —— 5” py) +- ae = (0 (7.4.41) 
ay(y) ay™ ve 
or 
ðL ðL 
L gery (2 p_i iH ") = 0. 7.4.42) 
an bP ONE Sa a ( 


The variational derivative 5L/8y)(x) is given by 


ôL _ ob s 


— 7.4.43 
wa oO a) 


so that 


re = — | ym Ta'a ~ y)dor~ 5 | dosa -p(y - = yu) 
ay Vy ay, 


which is uv symmetric. This completes the construction of T*”. 
In Section 7.8, we will find a second method of constructing a con- 
served, Symmetric stress tensor directly from an action principle. 


7.5. VECTOR FIELDS 


We return now to electrodynamics. We first ask for a Lagrangian density 
for a free massive vector field A,, analogous to the vector potential of 
electrodynamics. We require an invariant bilinear function of A,,, involv- 
ing at most first derivatives. A first guess might be Zo (G for a guess): 


Lo = —d,A" IA, + PAA.. (7.5.1) 


This choice has an obvious problem: If we consider each v value separ- 
ately, we have in (7.5.1) a sum of four scalarlike Y’s, three with the sign 
to give a positive energy and one with the opposite sign. Therefore, there 
is no lower or upper bound on the energy and the system is unstable. 
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How do we produce an ¥ with positive energy? There are two ways. 
The first, which we leave as an exercise (see Problem 7.9), is to write 
down all possible invariant bilinear functions of A, and 0"A, with arbitrary 
coefficients, calculate the energy, and adjust the coefficients to make the 
energy positive-definite. 

A second way is easier? and perhaps more enlightening. Notice in 
(7.5.1) that the space components of A,, carry positive energy, the time 
component negative energy. Furthermore, the space components form a 
rotational vector, and the time component a rotational scalar. Perhaps the 
scalar is present only to preserve Lorentz invariance, and we should try 
to eliminate it as an independent field. The clue is in the four-vector wave 
number k,,,° which for a massive propagating field will be timelike: 


kak® = pè. (7.5.2) 


Therefore, there is a Lorentz system for which k! = 0, and the Lorentz- 
invariant condition 


ka4%°=0 (7.5.3) 


will set Ao = 0 in that coordinate system. 
We expect the field to be radiated by a current density /,, according 
to an equation resembling 


(0.0% + p?)A* = -j` (7.5.4) 
(note rationalized units), but requiring that a propagating solution with 
wave number k“ will satisfy ka 4° = 0. 
We try to accomplish this by projecting (7.5.4). With wave number 
k”, (7.5.4) becomes 
((ike)(ik®) + 4?)A* = -P^ j” (7.5.5) 


where P is a projection operator that makes (7.5.3) hold for k? = u’. The 
projection operator P^, is clearly ô} — (k*k„/ u’), since 


k*k k? 
e(a) =E) 


This is especially true in the case of a second-rank tensor field, like the gravitational 
field, where, as we shall see in Chapter 8, the Lagrangian has many possible terms. 

*We go back and forth freely between a coordinate space description of the fields and 
their Fourier-transform space. 
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which vanishes at k? = u”. Note that this procedure, as opposed to the 
direct construction of a Lagrangian density suggested earlier, requires that 
one start with u + 0. 

Equation (7.5.5) now becomes 


. F a 
Ek (1.5.6) 
rE 


Sek Arey: = 
We eliminate k» j by operating on (7.5.6) with k,: 
k? 
(w—-k)k -A= -x-j(1 s) or k-j=-pk-A (1.5.7) 


and (7.5.6) becomes 
(W? -— k*)A* + kòk: A= -jà (7.5.8) 
In coordinate space, k, = (1/i) a, so 
p7A* + a,0°%A* — 0° 0, A% = -j^ 
or 
3a (ð “A` a a*A°) + p?A* = -j` 
or 
ða F% + p2A* = -j> (7.5.9) 
the natural extension of Maxwell’s equations to the case of finite m. 
Equation (7.5.9) is called the Proca equation. 


The cancellation of the factor u° — k* in (7.5.7) would appear to 
contradict the assumption that would make k- A = 0 for a propagating 


mode. However, the remaining identity, w7k-A = —k- jor 
Daj” 
OA = — Sed 
H 


shows that the cancellation does occur, since it is only at the source j, 
that ð A” fails to be zero. An equivalent statement is that k - A has no 
pole at k? = yw’, although A^ does have such a pole. The residue of the 
pole represents the propagating radiation. Since k: A has no pole, the 
propagating solution has k- A = 0. 

We see in (7.5.9) that there are only three degrees of freedom for the 
vector potential A“, since the second-order time derivatives only act on 
the space components A‘. Therefore, A° is a constrained variable, and 
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there are only three propagating modes. Of course, this results directly 
from the way in which we derived the equation, that is, from demanding 
that the propagating mode with k; = 0 have no A? component. The three 
degrees of freedom correspond in quantum theory to the three directions 
of spin of a massive spin one particle. 

The presence of the mass term has destroyed the gauge invariance of 
the massless theory. We also note that the limit u? — 0 requires a con- 
served current j^, since in that limit the left-hand side of (7.5.9), ða aan 
is identically conserved by the antisymmetry of F“*. Of course, in that 
limit gauge invariance is restored, and any one spatial component of the 
vector potential can be eliminated by a gauge transformation, leaving two 
independent modes. 

We next write a free field Lagrangian density that will yield (7.5.9) 
when the A” are taken as the independent coordinates.* Since it is the 
spatial vector coordinates that correspond to the actual degrees of freedom 
of the field, we choose the sign of Z to make their energy positive. Also, 
for u? = 0, Z should be gauge- and Lorentz-invariant, and bilinear in A”. 
The only choice is 


L= -c17 PeF tZ comtAyA® (7.5.10) 


where c, determines our units and c2/c, = 1 satisfies the field equation 
(7.5.9). 

From now on, we work in rationalized units with the free field La- 
grangian density 


2 
f= -JPE EAA, (7.5.11) 


We already expect, from our work with particle Lagrangians in exter- 


nal fields, that the scalar potential enters as —q¢(y,,t), where y, is the 
particle coordinate and g its charge. Since 


q6(¥p.t) = i dx(x) (1), 


where p is the particle charge density, we guess that the correct interaction 
Lagrangian density should be ¥,= j"A,. The Lagrangian density (to 
whose action we will eventually add the particle degrees of freedom) is 


*And j” is set equal to zero. 
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then 
1 av u? a "a 
L= — FT Fay F™ t AA" + [An (7.5.12) 
With 
Fy, = (0,A, T ð An) 
we have 
Oe O Sap (7.5.13) 
a(0,A,) 
and 
Lae pra’ +j” (7.5.14) 
a(A,) 
leading as expected to (7.5.9): 
=a F” = pA" + j”. (7.5.15) 


We may repeat the argument following (7.5.9) .rom the Lagrangian 
point of view, by noting that Ao does not appear in £. Therefore, the 
Ao equation 


aL 
a = 0, 

dAo 

is a constraint equation, and only the three A; equations are dynamical. 
As before, when we consider the p = 0 limit, we eliminate one more 
degree of freedom with the introduction of the arbitrary gauge function. 


We wish to verify that the energy P° of this theory, with j, = 0, is 
positive. It is 


po = fel AT N 2); (7.5.16) 
IA) 


since the canonical tensor ©”” and the symmetric tensor T#” give the 
same conserved energy-momentum vector P*. In terms of (¢,A)= 
~(A°, A’) with 


#=h(va+ Ay Lexar Epa) (7.5.17) 
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we have 


ovsa tA An VERAF a oni o 
= À- 0G+HAV +V xA +a- Eg 


Algai el oE 
2 2 2 2 2 


or, with m = Vọ + A, and using the equation [from (7.5.15), with j* = 0] 


Vim=p'o 


we find, after an integration by parts, 


2 242 . 2 
P= f dronn | aa T EE i xa a E (7.5.19) 
2 2 2 2u? 


which is positive-definite. 


It is also of interest to express the energy P° in terms of the Fourier 
transform of the vector field:° 


A, (x) = f d’k{a,,(k) eT + c.c,]. 


Since P? is time-independent, only cross terms between e*'” will be 
different from zero. In addition, the d?x integral will only connect +k to 


—k and, hence, a, to až. From the divergence condition on A,,, kođo = 
k- a, and from (7.5.18), 


P? = (27) | drl |a(k)[ - 


k-al’ 
| a Jiko? +k? + p) 


0 


or, with k in the three direction, and k/ko= €< 1 


P” = (2m | aPk{ (kè + kè + u? Jla + asf) 


+ (1 — &2)(ko? +k? + w)lasP}. (7.5.20) 


, ee boat aa 
5Note here, and remember in the future that e*7** =e HEX 
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We see that as u —> 0, k > ky and e —> 1, so 
Pr) | a? kk*[la,|? + |a2|7] (7.5.21) 


and the longitudinal mode a; carries no energy, unless a3 becomes singular 
as u — 0. This does not happen in the vector theory we are now consider- 
ing. We shall see later that the situation with the gravitational field is not 
so simple. 

From the general result (7.4.37) and the preceding discussion we can 
calculate the symmetric stress tensor T"” for the electromagnetic field and 
a charged particle in interaction with each other. The action is 


Bw Bw 
S=- if ats Fp Fe" [aof” 5 Ya An}. (1.5.22) 
4 2 do da do 


The stress tensor is 


H Jy” 
TH 2 (1 Fa E + 1 ISEE) +m | do B D t(x = y(o)) 
4 do do 


(7.5.23) 


in agreement with (6.6.19). (See Problem 7.5.) 
Evidently, we have chosen the signs of the free field and free particle 
terms in the action correctly, since the energy density 


m°(x ~ y(o)) 
2, 1/2 
dyo 
is positive. 


As we have remarked earlier, it is an exceptional circumstance that 
the electromagnetic and particle stress tensor T“” is the sum of the free 
particle and free field stress tensors, even though the equations for E, B, 
and y involve the interaction between them. It is clear from the discussion 
leading to (7.4.37) that T*” will normally have an explicit interaction term 
in other field theories. 

One must realize that the fact that the energy density (7.5.24) is 
positive (strictly, nonnegative) is a necessary, but not sufficient condition 
for the equations of the theory to have sensible (i.e., finite) solutions. In 
fact, the equations of the electrodynamics of point particles do not appear 
to have sensible solutions. In classical theory, this is manifested by the 
electromagnetic contribution to the inertia of the point particle becoming 


T! = JE +B?) + (7.5.24) 
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infinite (a consequence of the singularity of the electric and magnetic fields 
at the position of the particle). This problem was discovered by Lorentz, 
who tried to remove it by renormalization. We explain. 

Lorentz calculated the self-force (i.e., the force of the retarded fields 
generated by the electron on itself) for an electron, instantaneously at 
rest, but undergoing arbitrary motion as a function of time. To make the 
calculation finite, he assumed a charge density for the electron 


p(x, t) = ef(x — y(t)) (7.5.25) 


where 


[rŒ dx=1. (7.5.26) 


Lorentz found for the force 


dy 2edy 
F= -ôm — +74 ; 7.5.27 
d? 3e d? ( ) 
where 6m is the electromagnetic self-mass 
pata Í dx d’y FOOLY) (7.5.28) 
3 |x—y| 


and y(t) approaches y(t)) as the characteristic radius R of the cut off 
function f approaches zero. In the limit of a point electron, the integral 
(7.5.28) diverges like 1/R. The second term in (7.5.27) (which we have 
already seen in Section 5.9 and Problem 4.8) is independent of R as R > 0. 
In this limit, there are no other terms. 

Lorentz observed that the equation of motion for the electron would 
follow from (7.5.27): 


sie ep oe (7.5.29) 


where 


m = mo + dm. (7.5.31) 
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Since the equation of motion for the electron only involves m, and 
not my and 6m separately, one might optimistically hope that a finite m, 
achieved either by a negatively infinite mo, or by a cut-off at small r, 
would leave us with a sensible theory. 

These two alternatives must be considered separately. The first, a 
negatively infinite mo, does not work, because (7.5.30) has exponentially 
growing solutions. We have already seen a hint of this problem (without 
the infinite ôm) in Section 5.9, with (5.9.11) and (5.9.12). 

One can, in fact, show that the run-away solutions and the divergent 
self-mass are related problems: A charge density that makes the integral 
(7.5.28) finite does not have run-away solutions. Unfortunately, such a 
cut-off function would violate special relativity, since the action, as in 
Problem 7.8, would not be Lorentz-invariant. The application of relativis- 
tic quantum theory improves the situation but not enough: It turns the 
linear divergence of (7.5.28), that is, the linear dependence on 1/R, into 
a logarithmic divergence. The run-away solutions disappear, but new prob- 
lems arise. 

How should one deal with this situation? There is probably a modest 
consensus favoring the following view. 

We know that the electron mass is finite; therefore, our theory is 
wrong. At some small distance, Ro, the equations must become less 
singular so that the equivalent integral (7.5.28) converges. One can try 
to guess a value for Ro. A popular guess is the Planck radius R,,. R, is 
the Compton wavelength and radius of a body whose gravitational self- 
energy is equal to its rest energy.° This radius is 


1/2 


R, = (5) = 107 cm. (1.5.32) 


Here, G is Newton’s gravitational constant and Å Planck’s constant divided 
by 27. 

Why pick R,,? For one thing, the quantum theory of gravity must 
come into play at that radius. We have no satisfactory quantum theory 
of gravitation; perhaps a correct quantum theory of gravity would provide 
the necessary cut-off. Since the divergence is only logarithmic, the self- 
mass is relatively insensitive to the cut-off. The self-mass value given by 
quantum electrodynamics, cut-off at the Planck radius, is 


“ log (7.5.33) 
T 


“Incidentally, a particle with this radius would be a black hole. 
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where a is the fine structure constant: 
1 
= = 137 (7.5.34) 


and #/m,c is the Compton wavelength of the electron: 


h 1 
— == x 107 om. (7.5.35) 
MeC 


The resulting logarithm, log 107, is small enough so that 6m/m is still 
smaller than 1. 

Of course the cut-off, the distance where our present theories fail, 
could be much greater than 107% cm. We know experimentally from 
electron-positron collisions that electrodynamics holds at least as far down 
as 10° '° cm, so the cut-off could be anywhere from 10 '° to 107°? cm! 

To conclude this brief discussion of the boundaries to our understand- 
ing, we summarize. 

Our present theory of electromagnetic fields interacting with electrons 
does not lead to finite results. However, modifications of the theory at 
interaction distances that might be as small as 107°? cm might provide a 
consistent, finite theory, with no perceptible effect on present-day physics, 
including atomic and nuclear scale phenomena. We proceed with this 
assumption, even though we do not know how to construct such a theory. 


7.6. GENERAL COVARIANCE 


We next take up the subject of general covariance, that is, the study of 
objects that transform like tensors under general coordinate transforma- 
tions. We need this knowledge in order to formulate a consistent theory 
of gravitational fields (Sections 8.6 and 8.7). We consider it now because 
it permits us to construct a symmetric stress tensor T*” directly from a 
generally covariant action (Section 7.8). In addition, it permits us to write 
known dynamical equations in arbitrary coordinate systems. 

We start from the notion of physical tensors under Lorentz transfor- 
mations and define an extension to general coordinate transformations. 
Whatever the coordinate transformation x'(x) be, define a contravariant 
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vector to transform like dx“, that is, like 


iH 


dei (7.6.1) 
ox” 


Higher-rank contravariant tensors transform like products of vectors, so 
that 
ax’ ax!” 
PO Se a A (7.6.2) 
ax” ox” 
etc. 


Tensor equations hold in all coordinate systems. Thus, if V” = 0, so 
will 


ox'* 


ax” 


vie (7.6.3) 


and conversely. To see the converse, multiply (7.6.3) by ax*/ax’". There 
results 


a A tH 
DX yn DX ON y V>, (7.6.4) 
ax’! ax'* ax” 
The tensor transformation property defined by (7.6.1) is consistent. 


That is, if 


and 


then 


or, by the chain rule, 


y'= - V”. 


There are physical invariants under Lorentz transformations. We 
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define them to be invariants under general coordinate transformations. 
Thus, if œ is an invariant field, 


ð 
p(x (x)= (x) and = h(x + &x)— h(x) = oe bx” (7.6.5) 
x 
is also an invariant. Evidently, 


ap’ — ad əx! i 
ax’*® ax” ax’ 


(7.6.6) 


this transformation rule is called covariant. Just as in the case of Lorentz 
tensors, the contraction of a covariant with a contravariant index produces 
an invariant. We see a special case in (7.6.5); the general rule follows 
from the defined transformation properties. 

Contravariant, covariant, and mixed tensors can be found by multipli- 
cation. As usual, symmetry properties are preserved under tensor transfor- 
mations. 

An interesting object is 64, which is a mixed tensor since 


IH o 
ar = E Am Gt, (1.6.7) 
ax” ax!” 
The space-time interval 
dr” = dé" nps dE” (7.6.8) 


where the &"’s are the normal rectangular coordinates in some Lorentz 
system is an invariant. In a general coordinate system, it will have the 
form 


2 _ 96" ag” 
ax? ax? 


=g,, dx? dx. (7.6.9) 


dt Nur dx” dx” 


Since d7? is an invariant, 8s, is a tensor and symmetric. It is called the 
metric tensor.’ The tensor Zur can be used to lower indices. Thus, if V” 
is a contravariant vector, V, =g,,V” is its covariant representation. This 
can be seen as follows. Let U“ and U, = n,,U” be the contravariant and 
covariant representations of a vector in a Minkowskian coordinate system. 


"Conversely, if gu» is a tensor, dr’ as defined in (7.6.9) is an invariant. 
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Remember: If U” = (U°, U), U, = (U", — U). Then, by the definition of 
contravariance and covariance, 


H 
ye =X yp (7.6.10) 
0g 
and 
ag" a^ 
Va =— U, = sUr 7.6.11 
E ax” à ox” a ( ) 
Since 
yo = Ey 
ax’ 
Equation (7.6.11) yields 
og` aél 
Va = — |e V” =g, V”. 
O axt ha ox” 2p 


The metric tensor has its contravariant counterpart, defined here by 
gig, = bt; (7.6.12) 

from (7.6.12) we see that lowering both indices of g"” produces g,.,, since 
Euograg = Bucdy = guv- (7.6.13) 
The determinant of g,,, is another interesting object. Of course, the 


determinant of Nu» is —1 and remains unchanged under Lorentz transfor- 
mations. The same is not true of det g.. We define 


g= —detg,, (7.6.14) 


and calculate g’: 


and 


ul ae 
Ve (cet a Vg. (7.6.15) 
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An object that transforms like Vg is called a scalar density of weight 
—1. The number of powers of det(dx‘/dx) that multiply a normal tensor 
transformation law is called the weight of the tensor density. Thus, for 
example, ( 1/V2)T*” is a second-rank tensor density of weight 1. 

The transformation property of Vg provides us with an invariant 
volume element. Since 


= Vg d*e', (7.6.16) 


Vg d*x is an invariant volume element. 

We have learned how to rewrite some Lorentz covariant formulas so 
that they are generally covariant. For example, suppose ¢ is a scalar field 
and C, a covariant vector. Then 


ax = Cu (7.6.17) 
is a generally covariant equation, as is 
A, g”” B, = constant (7.6.18) 


if A, and B, are covariant vectors. The equations of electrodynamics, 
however, involve space-time derivatives of vector and tensor fields. In a 
Minkowskian coordinate system, these form Lorentz covariant tensors of 
one higher rank; they are not tensors under general coordinate transforma- 
tions. To take a specific example, we recall that the space-time derivatives 
of a contravariant vector U“ in a Minkowskian coordinate system form 
the components of a mixed Lorentz tensor: 


aut — 
agé* 


Q"). (7.6.19) 


We can therefore define a tensor T”, in a new coordinate systems by 
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its transformation from Q”,: 


_ ax’ IE o 
ag? ax* T 

E öx” aE au” 
ae? axò aé7 


Bw 
a 


(7.6.20) 


On the other hand, we have already agreed to define the vector V“ 
in a general coordinate system by the equation 


H 
Vi =U", (7.6.21) 


If we now calculate 0V“/ax*, we find 


ea Kau? T x” 
We AU a=) u“ (7.6.22) 
axò BET OET axd  2x*\ ak" 
or, since 
a o 
U7 -= og” YE. (7.6.23) 
ox" 
ave 3 7 o ¥\ g oa 
Yo et HE ev (1.6.24) 
ax BET axò d&" axa”) ax” 
= T“ -TH V" (7.6.25) 
or 
H 
re, = rey’, (7.6.26) 
Ox 
where 
a o 2)u T 
re, = ogy ax" ae" (7.6.27) 


ax” a&7 0&7 ax 


T“, is called the covariant derivative of V”. It is the tensor that in a 
Minkowskian coordinate system is the tensor dU */aé". 

The extra term in (7.6.26) arises from the correct formulation of 
parallel displacement. If we displace the vector U” from & to € + dé, the 
new vector is still U”. However, in a general coordinate system, the first 
vector is 
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ve =— ye (7.6.28) 


and the displaced vector is 


H 
Ve + me Gt 5x) U” 


ð H 
=V” + +=) x` UT 


ax*\aé? 
O 
=y“ 4 “(2 ) ox’ oye (7.6.29) 
ax” \dé" ax” 
or 
SV = -TH xV”. (7.6.30) 


The covariant derivative subtracts the parallel displaced vector from the 
vector at the new point; this results in a covariant derivative, as given by 
(7.6.26). 

The three-index quantity F¥, is called the affine connection. Although 

ky has tensor indices, it is not a tensor. For example, it vanishes in a 

Minkowskian coordinate system (x = £). We note here that IX, is sym- 
metric in A and v. 

We introduce some convenient notation: The ordinary derivative of 
V* is written as 


ave 
e (7.6.31) 


The covariant derivative is written as 
T“ = V". (7.6.32) 


The rule for covariantly differentiating a higher-rank tensor follows 
trivially from the above procedure: There is one extra “kinematic” deriva- 
tive for every éx/d& in the transformation replacing (7.6.21). Therefore, 
one must replace the single extra term in (7.6.26) with a sum: 


seve 


ax? 


ver, +T, V” +T, Ve" (7.6.33) 


etc. for higher tensors. 
Finally, we derive the equivalent rule for a covariant index by noting 
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that if A, and B” are vectors, then A,B” is a scalar, 0,(A,B") is a 
vector, and 


8,(A,, BY) = (31 Ap) BY + A, (0, B”). (7.6.34) 


On the other hand, we can consider (7.6.34) in a Minkowskian coordinate 
system and then transform each term separately. The second will become 


A B” (7.6.35) 
and the first 
Apa BY. (7.6.36) 
Therefore, 
A, (B"., — BY a) + B“ (Apa — Apa) = 0 
or 


A, VXo B7 + B'(Ag., — Aca) =9 
so that, since A and B are independent, 
Ana Ang TAi (7.6.37) 
We can express the I’s in terms of the metric tensor by noting that 


the covariant derivative of gą, must be zero, since g,,, is a tensor, and 
Suv = Nan, a constant, in a Minkowskian system. Therefore, 


Way 
ax? 


Envia = 


ae Tha Err T Tia Eru = 0. (7.6.38) 


We can solve (7.6.38) for T. Interchange v and A in (7.6.38) and 
subtract. Then interchange u and A in (7.6.38) and subtract this from the 
first subtraction. There results 


Erva T Bran T Ena nw t 21 whe = 0 (7.6.39) 
OT 


GÀ 


ps ace + Burw — Bara): (7.6.40) 


Thus, given the components of g, F can be calculated. We will see in 
the next section that we can find a coordinate transformation that takes 
any metric tensor to Minkowskian form at a point, with vanishing deriva- 
tives, and thus vanishing I’. This a posteriori will justify the procedure of 
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transforming from a Minkowskian system to derive the formulas for co- 
variant differentiation, whether or not the underlying space is Minkowsk- 
ian.’ 

It is possible, but nontrivial, to derive (7.6.40) directly from (7.6.9). 

We now know how to make any Lorentz covariant expression 
generally covariant. We simply replace all derivatives by covariant deriva- 
tives, and all d*x integrals by d*xVg integrals. 

There are some special cases worth noting: 


1. Apiw S Aviu z (Ags sj Fur A) z (Ags =! VAs) a Au,» — A, ue 


(7.6.41) 
This is a very helpful formula in electrodynamics. 
2; Favia + Fay + Fray. (for Fa, antisymmetric) 
= Bupa T ik Fy Te Fin Fe 
ah Ba Pis Fru a Dir Ex 
+ Foxi — Fou Fyn ot Fia Fz: (7.6.42) 


T 


by virtue of the antisymmetry of F and the uv symmetry of Ta», 
all the [ terms cancel, leaving 


Fish + Fyysy + Foxi — Fiva + Fiko + Fun p- (7.6.43) 


This is also a very helpful formula in electrodynamics. 
3. The divergence of a vector (or tensor) V“.,,. From (7.6.26), 


V*, = aV"4+TR,V” and Ve, =ô V +r V”. 
. (7.6.44) 
From (7.6.40), 


ge 
ri = g Bua (7.6.45) 


which can in turn be calculated from the determinant g = 
—det(g,,,): 


g + dg = —[det(g,, + 5g,)] (7.6.46) 


“That is, whether or not there is a coordinate system in which gu» = Ny, everywhere. 
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and 
1g + 8g) = det(g’*) det(guy + Sgu) (7.6.47) 
g 
= det g°"(g,, + Òga») 
= det(57 + 2” Sg,,) (7.6.48) 
so 
1a 
BB prep, and «= Pe, = — <8. (7.6.49) 
g 2g Ax 
For V“.,, we then have 
{ dg 1 
yve =0,Ve+——2V" = — 9 (Vg VE). 7.6.50 
u Z Op Zg ax” Ve u(VgV") ( ) 


The divergence of a tensor follows in a similar way: 


_ are" 
ax” 


F” +74, F +T? Fe; (7.6.51) 


for the special case F*” = —F, the last term in (7.6.51) is zero, 
and we have, for the electromagnetic field tensor, 


nv S tb uv 
pe gn VF ). (7.6.52) 


. We can use (7.6.50) to derive the general expression for the n- 


dimensional Laplacian (or pseudo-Laplacian, if the space is Min- 
kowskian rather than Euclidean): 


ge Vg VBE doth) (7.6.53) 


where y is a scalar. 


We can illustrate (7.6.53) with the familiar case of V7y in orthogonal 


coordinates in three dimensions: With 


(ds)? = hi dq? + h3 dq} + h3 dq3 = dq" dq” and Vg=hihohs, 


(7.6.54) 


we have 
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1 ð 1 ð 
Vy = (rihh > + ), (7.6.55) 
hihzh3 oq, hi 0g 


7.7. LOCAL TRANSFORMATION TO A 
PSEUDO-EUCLIDEAN SYSTEM 


We will now show that any symmetric tensor that is analytic in the neigh- 
borhood of a point can be transformed to pseudo-Euclidean form at that 
point. Let gu» be that tensor and xo = 0 that point. The theorem states 
that we can find a coordinate system in which g,,(xo) = n^n,» and 
O8ur/0Xy |x=xo = 0. Here, Ny» is the appropriate pseudo-Euclidean tensor, 
in that it must have the same number of positive and negative eigenvalues 
(all +1 in this case) as g,,. As was noted earlier, this shows that the 
existence of a coordinate system in which the tensor 8u» = Ny» everywhere 
is not necessary for the arguments of Section 7.6. We start in a coordinate 
system with metric g(x). In the neighborhood of x = 0, 


Buv = Bur(0) + A.8,.(0) xÀ. (7.7.1) 
We transform to a new coordinate system 


Bex’ bey xx 


x = atx’ + Paes and x“ = aby! + 3 


(7.7.2) 


for x and x’ close to zero. The equation for g» is 


ax* ax? 


ax'™ ax” 


1 _— 
Env = 


Bro 


or 


Suv = (ah + bha x’ Vat + bis x" \[grc(0) + a8r0(0) ag x"). 
(7.7.3) 


We must now solve for a’s and b’s that make g,,(0) = n,, and 
dg'.,/ax'* = 0. The first condition specifies 10 values of g1,,(0), and there 
are 16 a},’s. Thus, six parameters are left over, corresponding to the six 
parameters of a Lorentz transformation, which are undetermined, since 
Nu» is left invariant by a Lorentz transformation. There are 40 0,g,.,’s 
and 40 bX,’s, so that there are just enough conditions to determine the 
local transformation ðx'/ðx up to a Lorentz transformation. We must still 
show that the equations have a solution. We divide the proof into three 
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parts. First, we diagonalize g}, at x =0. Since gą» is a real, symmetric 
tensor, we can diagonalize it by an orthogonal transformation. An ortho- 
gonal transformation can always be written as e4, where A is real and 
antisymmetric; hence, there are six independent parameters that are 
determined by this process. The metric will now locally be of the form 
Euv = 5,,A™, where the A“? are the eigenvalues of the original matrix. 
We can now reduce the A’s to +1 by a scale change, determining four 
more parameters. It must be the case that three of the eigenvalues of g 
are negative and one positive; otherwise, we cannot transform to Np» 
without a singular transformation. 

The remaining 40 equations are expanded to first order in x’, with 
aù = ô} and gao (0) = mo: 


Suv = Nae +X (br, me + Boo Mu + pga l0), (7.7.4) 


and therefore, 


Dis Nav + bòp Tap de Op8ur = 0. (7.7.5) 


The solution to Eq. (7.7.5) is easily seen to be [repeating the work leading 
to (7.6.40)] 


AT 
bip =~ a (ogur + O87 T A7Bup) p (7.7.6) 


If we try to go to one higher power of x, that is, make all second 
derivatives vanish, we will, in general, fail. The number of conditions 
is now the vanishing of 0,0,g,., or 10 second derivatives of a 10-com- 
ponent tensor; thus, 100 conditions. The coordinate transformation is 
xe = E a dias the available number of transformation parameters is 
four (for u = 0, 1,2, 3) times the number of components of a symmetric 
tensor of rank £ (with £ = 3) in four dimensions. This number is calculated 
in Appendix B. It is given by (B.2.7): 


€ on nyom 
set,4)=} D> È 1 (7.7.7) 
ny=O07t2=0 n3=0 
€+ 1)(€ + 2)(¢ + 
6 
and for £ = 3, S(3 ,4) = 20. Thus, there are 4 x 20 = 80 adjustable para- 
meters in the transformation, and 10 x 10 = 100 equations to satisfy. This 
leaves 20 combinations of second derivatives that cannot, in general, be 
set equal to zero, As we shall see in the next chapter, this is just the 
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number of components of the curvature tensor in four dimensions. (See 
also Problem 8.6.) 

In three dimensions the number of nonzero second derivatives is six; 
in two it is one, and in one dimension it is zero. In each case, this number 
is the number of components of the curvature tensor. 

We see that we cannot, in general, carry out a local coordinate trans- 
formation to a pseudo-Euclidean coordinate system, up to vanishing se- 
cond derivatives of g,,, although we can do so, up to vanishing of all first 
derivatives of g,.,. 

The metric tensor can therefore inform us of intrinsic properties of 
the space: for example, as just seen, the impossibility of finding a coordin- 
ate system for which the metric is pseudo-Euclidean. We recall here two 
examples with which the reader is surely familiar: an invariant interval 


(ds)? = (dr)? + r?(d0)° + r° sin? O(de)*, (7.7.9) 
corresponding to a diagonal metric tensor 
Br=1, Bee=r, Boe =r" sin 0, (7.7.10) 
will permit a transformation 
zZ=rcos@, x = rsin ĝ cos Q, y=rsin@sing (7.7.11) 
which expresses (ds)? as 
(dsf = (dx)? + (dy)? + (dz)? (7.7.12) 
for all r, 6, @. 


A space of two variables, @ and yg, withO=@= 7 and 0S 9 =27, 
and invariant interval 


ds? = R?[(d0} + sin? 0(dg)*] (7.7.13) 


will permit no such transformation. Of course, (7.7.13) describes a spheri- 
cal surface embedded in three-dimensional Euclidean space. We will see 
later, when we discuss curvature, precisely how the metric tensor deter- 
mines intrinsic geometry. 
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7.8. ALTERNATIVE CONSTRUCTION OF A 
COVARIANTLY CONSERVED, SYMMETRIC 
STRESS-ENERGY TENSOR 


The method takes advantage of the possibility of writing a Lorentz- 
invariant action in a generally invariant form. Thus, we introduce a sym- 
metric tensor g„» and rewrite the special relativistic action with the substi- 
tutions 


d*x —> Vg d’x (7.8.1) 
Ap Ay; (7.8.2) 


etc. 

The action S is now a general invariant, so a general coordinate 
transformation does not change it. The main point is then the following: 
A general, but infinitesimal coordinate transformation changes all the 
dynamical variables. However, provided y5, A„, etc. satisfy the Lagrange 
equations, the infinitesimal variations 5y>, 6A,,, etc. will leave the action 
invariant. Therefore, the only interesting consequence of the transforma- 
tion is the change of g,,, which is not a Lagrangian variable; this change 
alone must therefore leave S invariant. We shall see that the statement 
ôS = 0 is equivalent to the conservation of a specific symmetric tensor 
TH 

The algorithm is the following. Write S in generally invariant form; 
then make an infinitesimal variation in gur, 8..— Su, + 8u.- T”” is given 
by the equation 


1 
ôS = = { d xV gT” guv, (7.8.3) 


which obviously defines a symmetric tensor. We will show that T”” is 
conserved: 


T” =0 (7.8.4) 
is a correct equation with .„ representing the covariant divergence. All 


we need for the special relativistic case is this equation with g„, set equal 
to nu», where (7.8.4) would become 


T 0,7 SO. (7.8.5) 


Note that (7.8.3) would change sign were we to choose the metric ,, 
with positive space components instead of a positive time component. 
Consider as an example the action for charged particles with charge 
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qp interacting with the electromagnetic field. We know the action 


m, ( dx? dx? 
S= -3% (oe P uv do, ~E | dR 
p 2 a do, 4 


+ Dap [e Au(Yp) do, (7.8.6) 


op 


where ø, is an invariant parameter associated with the pth charged parti- 
cle. We remember that this choice of S is not reparametrization-invariant, 
so that da, will be determined by the equation of motion. It is proportional 
to dr,; choosing dao = dr makes m, the observed particle mass. 

We can easily write a generally covariant form for (7.8.6): 


dx dx? 1 
say Mp de ; TR E - E f axver 7 Ho AF, 
P p&u do do, 4 Blur 8 a 
+ D4, | do, 22 A,(%)- (7.8.7) 
We make a small change in g,,,: 
Eur > Buv t guv (7.8.8) 
and from 
5(g"*g,,) = (8%) = 0, 
observe 
gI = =g igang" (7.8.9) 
and recall from (7.6.49) 
- = g" bgay: (7.8.10) 


The result is 


6S = oa ve | do, | d*x8*(x — yalo) ZE an ôg, (x) 
Op 


+ | d*xVg6g,.(F",F™ + F*F,”), (7.8.11) 
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so that, setting g"” = 7”, we find 


dx, dxp 4 
=m, | do, —~-—* t(x -— y (0 p)) 
P "da; dop 


(nt Fag FO? — FHF” + FF] (7.8.12) 


me 


Note that the last two terms are equal and symmetric in u and v. With 
do, = dtp, 


dt 
| aoo% — yp) = fart - y,(r)) = 7 5°(x — yp) 


P 
=V1-v; 25°(x — yp) (7.8.13) 
so that 


dy% d 
=2 m,VI =v? rA a 5°(x — Yp) + ; Wek ie F — FEF, 


(7.8.14) 


in agreement with our earlier result (7.5.23). 
We call attention to an important property of (7.8.14), the vanishing 
trace of the electromagnetic stress tensor: 


T,” = b m, VI = v2 (x — xp) + Fro P? = Fp F” 
P 


=} m, VI = v28(x — yp). (7.8.15) 


Note also that the particle contribution to (7.8.15) can be written as 


2 
m 
2 Oe %) 


pop 
and therefore vanishes like m?, as m, > 0, E, remaining finite. In the case 
of a massive vector field, with a term 


2: 
bf = Awd 


added to the Lagrangian density, the trace of the vector field stress tensor 
no longer vanishes. 
We show that (7.8.3) defines a conserved T””, Assume 
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= | d*xV g(x) L(b.(x), x, Yp) (7.8.16) 


where Y is an invariant density and w, are the dynamical variables in the 
Lagrangian, including the metric g,,. The meaning of the y, is clear 
enough for Au, Zav, etc.; for the particle variables, we would write 


m dy} dyp 
Sp, =- d J 
P 2 l o de, Sz On) a5 = 


dy” dy? 8 
=- = | d'xVedo, 2E 2 dy Ey) 1817 


do ee an Vg 


Op 


The action S is invariant under a general coordinate transformation 
x” = x°(x'), under which 


d*x—>d*x', g(x) > g'(x'), An >A, A,(x), 


etc., so 


= | ate VEO) LAG) Basle 8" Yor) (1818) 


The integration variable x’ in (7.8.18) can be changed to x without 
changing the value of S. Thus, in one dimension, 


fa f(x) = farse (7.8.19) 


with x' and x single-valued functions of each other with the same end 
points, and f an arbitrary function. Similarly, 


= [ daves Ww. (7.8.20) 


The superscript Z” instructs us to calculate Z as a function of the transfor- 
med field, but with x’ set equal to x. For example, if ẹ is a scalar field 
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and x = x' + 6x, then 


px) = W(x) 
and 
w(x) = WO’) + W(x) — WO’) 
Y 


= y(x) + ag 5x" (7.8.21) 
Ox 
to first order in 6x; similar definitions hold for the other variables. 

We now return to (7.8.20). As shown above, we replace x’ by x, 
which leaves § unchanged. Next, we expand S in the first-order changes 
in Aa, yp, etc. Since A,, yp, etc. obey the Lagrangian equations, their 
first-order changes leave S invariant. The only change that could affect $ 
is in gy, with 


ButX) = Eux) + Bulx) — Bale’) (7.8.22) 
ax” ðx` ( a ) 
= — gon +| —— 8u) 5x” 7.8.23 
ðx'” aÊ a ax” Su ( ) 
obs) i pax) 
=| ôn + ô} + oa + OX" ðo Suv 
Ç ax“ TAi aH 
a5x7 asx 
Saad go + Epa + Ox" aguo (7.8.24) 
ax” ox” 


still accurate to first order in ôx. However, this transformation does not 
affect S, since S is invariant. Thus, we have from the definition (7.8.3) 


1 
s=- 5 { d'ave T 8gp (7.8.25) 

1 ôx” 06x” ð 

mA d* veT™( av saa ot bx? —— ) 
Al ie ax” j ax” Eu 7 ax? ou 

1 
= fas 8x18 (Bo VET) + (8 VET) — (Pugu) VZT} 
a (7.8.26) 


Since the integral in (7.8.26) is generally invariant, (7.8.26) can be 
written as 


0= f d*xVg 8x" Qe (7.8.27) 
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where Q, is a vector under general coordinate transformations. Since 
locally, in the € system, Q, = (d/aé“)T“, , Qo must be the vector: 


Or =T Eei (7.8.28) 


Equation (7.8.27) then shows that T“, and hence 7“’, is conserved. 


CHAPTER 7 PROBLEMS 


7.1 


Consider a particle of mass m moving in an external scalar potential 
w. Consider the action 


s= |La 


with L=—(m+gw)V1-—v* and g a dimensionless coupling 
constant. 


(a) Derive the equations of motion for x(t). 


(b) Show that they form the space components of a consistent four- 
vector equation. 


Consider next the action 
S= | L’ do 


with L' = —(m + gis) dy“/do dy,/do and o an invariant parameter 
replacing time. 


(a) Derive the equations of motion for y“(c). 

(b) Show that (m + gw) dy"/do dy,/do is constant and that the 

constant may be freely chosen without affecting the equations of 

motion. 

Compare the equations derived from L’ with those derived from 

L in Problem 7.1 and show that there are no scale changes of g 

and m that bring them into agreement. Thus, if we assume that 

we complete the scalar theory with the usual field Lagrangian for 

w, the two Lagrangians L and L’ lead to different theories. 

(d) Show, however, that there is a nonlinear transformation of the 
field win the Lagrangian L’ that will bring it into agreement with 
the Lagrangian L. This shows that with a given external field y, 
one can use either Lagrangian for determining the particle mo- 
tion, providing one makes the appropriate transformation of y. 


(c 


— 
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*7.3 


7.4 


*7,5 


*7.6 
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Complete the construction of a scalar field theory outlined in the 
text: (7.4.22-7.4.27). 


(a) Derive the Lagrangian equations for the field y(x) and the parti- 
cle coordinates y“. 

(b) In the non-relativistic limit (gy/m < 1, a ylðt <% |V], do = dy’), 
show that the interparticle force is attractive between like part- 
icles. (Do not try to calculate the force of the particle on itself.) 


Verify that ©“, as given by (7.4.28) is conserved, 0,0*%, = 0, and 
P, = f O°.d?x is correctly given by (7.4.21). 


From the action for an electromagnetic field interacting with a 
charged particle, 


1 dy” m dy” dy 
galg aria fa ay 4 al ay Uu 
| Yp A a A V o 


where A,„(x) and y“ are the dynamical variables: 


(a) Construct the canonical stress tensor ©*” given by (7.4.29). 

(b) Construct from it the symmetric stress tensor T#” as defined by 
(7.4.37). 

(c) Show that in the u? = 0 case the energy P° = | T° dx is given 
by (7.5.24). 


Consider a Lagrangian density ¥ that is a function of a set of fields 
W(x) and their first and second space-time derivatives, 0, W(x) and 
pA a(x). 

(a) Derive the Lagrange equations for this case. 


(b) Assuming that has no explicit x dependence, apply the Noether 
procedure [as in (7.4.9~7.4.17)] to find an expression for the 
energy-momentum four-vector P%. 


(©) P“ has the form 
Pp ac [ax oe 


where ©°% is the 0-a component of a tensor @°*. Construct the 
tensor @** and show that it is locally conserved: 0.0% = 0. 

Construct a single invariant Lagrangian of the form 4,A“, with 
A” a function of a single scalar field ẹ and its first derivatives. 
Imagine that this function is added to an existing Lagrangian. 
Verify that the Lagrange equations are unchanged by the addi- 
tion. Show that the tensor ©“” is changed, symmetric, and con- 
served, but does not contribute a change to the energy-momen- 


(d 


~ 
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tum vector P“. (The fact that ®©”” is symmetric will not, in 
general, be true for more complex theories.) 


*7.7 A Lagrangian density Y is changed by adding a AY= ð A" (po, 
d,W,), where the Yo are the set of fields described by the Lagrang- 
ian. 

In order to simplify the problem, choose A” so that the AẸ (as 
well as Z) depends only on the fields and their first derivatives. 


(a) Find the condition on A” such that AY not depend on second 


derivatives. 


(b) A” (and £) are taken to be independent of x” except through 
the x” dependence of the fields. Show that the Lagrange 
equations for the fields are unchanged by the addition of AY. 


(c) Construct the change in the canonical stress-energy tensor AQ° 
and show that it is conserved (but not zero). 


(d) Show, however, that the change in the energy-momentum four- 
vector 


APE = | d°x AO% 


is zero. 


*7.8 The action that yields the model of Problem 3.2 is 


S= [erag fano 


where 


#= 500+ A} (=A) -dea D ~ ¥- AC, ofa- 9) 


and L, = 5 my’. 
Define the field Lagrangian L = | £d°x. 


(a) Show that the above action yields the equations of Problem 3.2. 


(b) Show that, provided f(x — y) = f(|x — y|), the Lagrangian L + L, 
is invariant under a rotation; that is, a transformation 


p'a) = pala) 
where to first order in € 


x’=xt+exx 
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so 
p =p-exx: Vo 
A'=A+texXA-exxVA 
and 
'=y+exXy. 


(c) Use Noether’s theorem to construct the conservation law emerg- 
ing from this symmetry. The conserved quantity ôQ = e: L; the 
axial vector L is a conserved angular momentum. 


(d) Show that L = Lem. + Lmech + Lint, where Lem, is the electrom- 
agnetic angular momentum: 


1 
Lew a ~ faxr x (E x B), 
4a 


Lmech the particle angular momentum: 
Linech = Dy; X mii, 


and Lin an “interaction” angular momentum: 
Lwa | v-e x Aste y) 


(e) Show that Lin is gauge-invariant [only, of course, when f(r — y) 
is invariant under rotations: f = f(|r — y|). 


*7.9 Show that the Lagrange density 
L= -5 (.A,0"AY ~ að, A"9,A") 


does not lead to a positive energy except for a = 1. 


(a) Do this in two steps. First, with œ = 1, show that the A? equation 
is an equation of constraint: 4;(d'A° — a°A‘) = 0. 

(b) From this, using the technique that led to (7.5.19), show that 
the energy integral P° = fo” d*x is positive-definite. 

(c) Now take a #1. The equation for A’ is no longer a constraint 
equation, since it gives the A° in terms of the fields (potentials) 
and their first derivatives. Show that the energy integral is now 
unbounded in both positive and negative directions. 


7.10 Consider a freely propagating electromagnetic field. Let the vector 
potential have a well-behaved Fourier transform a,(k). a„(k) satisfies 
the Maxwell equations 
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ka, — kyk*a, =0. 
(a) Show from this that if k?=k2~—k*#0, the field Fourier 


transform k „a, — k a, is zero. 


(b) If k? = 0, evidently k*a, =0, and we are in a Lorentz gauge. 
Clearly, the requirement that the Fourier transform for A be 
well behaved is a strong condition. 


7.11 (a) Consider a flat two-dimensional space with polar coordinates 
x’ =p and x? = gy which we define to be contravariant. The 
invariant distance squared is 


(ds) = (dp)’ + p*(dg)’. 


Find g,, (p, v =1,2), g"”, and express the invariant V7y = 
(g*”d.W)., in terms of p, and derivatives of the scalar y with 
respect to p and ¢. 


(b) What are the covariant coordinates x, and x,? 


(c) Repeat the exercise for spherical coordinates in three dimensions. 
There 


(ds)? = (dr)? + r?(d0)* + r° sin? O(de)’. 


7.12 Consider a vector V = iV, + jV, + kV,, whose spherical components 


are V. Vs, and V,. That is, 
V=TV. + 61%, + V, 
where f, 6, and ĝ are unit vectors in the corresponding directions. 


(a) Give V., Vs, and V, in terms of the rectangular components of 
V and r. 


(b) Give the covariant components of V: V,, Ve, and V, in terms 
of V, Vo, Vp- 
(c) The same as (b), but give contravariant components. 


*7,13 We consider here a pair of scalar fields, y, and 2, with a Lagrang- 
ian 
1 P 1 i 2. 2 
L= PAd uy + 3 outed Wo — Vie t+ 2). 
We note the symmetry of £, under a rotation like mixing of y, and 
W2, that is, 
y'i =u, cosat+ psina, wb’. = -pisina + 2 cosa. 
The infinitesimal symmetry is 


bb, = bapa, by. = -õa pı. 
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The Noether charge is 
g 
50 = [i ae ee A2 sys} dx 
(do) A(do2) 
which suggests a locally conserved current 
JË = (p8p — bid" pa). 


A more convenient representation is obtained by introducing a 
complex field 


j= pı tip 
v2 
and its complex conjugate 
pı- ip 
* — SL. 
y B 


in terms of which 
Ly = ðu p * y — V (pp), 


and the invariance described above becomes the invariance of £ 
under the phase transformation 


yy = es 
with real, constant a. 
(a) Show that the independent variation ôy, and ôy can be replaced 


by independent variation 5y and 6%*; then verify that the Lag- 
range equations are the same. 


(b) Find the Noether current arising from the phase invariance yp’ = 


ey. 
The phase transformation is “gauged” by permitting a to be 
space-time-dependent and introducing a vector field A, into 
Ly = (0% + ieA”) Wy — eA, — V(p*y). (1) 
Evidently, £Z; is invariant under the combined transformations 
p =p, Wik =e "ye, and = A' =A, + On. (2) 


One must add the Lagrangian £4 of the vector field to have a 
complete theory of the interaction of the A and y fields. Of course, 
£a must also be invariant under the gauge transformation of A for 
the symmetry to hold. However, independent of £4, 


(c) There is a conserved current for the new £. It is 
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Ly 
dA, 


K 


(3) 


Show by explicit calculation that this current is conserved for w 
and ẹ* satisfying their equations of motion. 

(d) Show that the gauge invariance of the Lagrangian (1) implies the 
conservation of the current given by (3). The general theorem is 
the following: Given a Lagrangian £,(.,, A ) that is invariant 
under the transformation Ya > Ya + 6%, and A, >A, + 6,A, 
then j*=0L£,/dA, is a conserved current, that is, ô „j“ = 0, 
provided the Lagrange equations are satisfied by the w fields. 
Hint: If the Lagrange equations for the y, are satisfied, the 
action S = | d*x &, is also invariant under the transformation 
Ya — WW. + Sy, with no change in A,,. 
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CHAPTER 8 


Gravity 


8.1. THE NATURE OF THE GRAVITATIONAL FIELD 


The essential phenomenology that leads to a theory of gravity was given 
by Newton: 


The force of gravity between two bodies is always attractive and proportional 
to m; m/rj2, where m, and m, are the inertial masses of the two bodies and 
rı2 the distance between them. 


From our discussions of massive field theories, we recognize that the 
gravitational field must be massless and presumably satisfies some equation 
like 

ad 
Vd - — = 4r 8.1.1 
Tar p (8.1.1) 


where p is the source density and ¢@ some component of the gravitational 
potential (for simplicity, we will call it the gravitational potential). The 
asymptotic potential arising from a body with source density p; will be, 
just as for a scalar field, 


$) = -+ fap, (8.1.2) 


and the asymptotic interaction energy with body two will be 


aul 


Ti2 


| dry po (2) | a Bh (8.1.3) 


with the minus sign for attraction. Therefore, we expect to have 
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fa pilr) = Am 
(8.1.4) 
f de pate) = ams 


with A a universal constant. 

The remarkable feature of (8.1.4)—the equality (to within a choice 
of units) of gravitational and inertial mass—is known to hold to very 
high accuracy. All bodies at the same point in space fall with the same 
acceleration to about one part in 10''. We will try to find a relativistic 
field theory that accounts for the simple phenomenology outlined above. 
We will be led to a theory that will turn out to be a linear approximation 
to Einstein’s theory; when we try to make the theory internally consistent, 
we will be led to the complete Einstein theory. This approach was initiated 
by Feynman,’ Gupta,” and Thirring.* 

What kind of field can carry gravity? Clearly, a vector field is out of 
the question, since it generates a repulsive force between like particles. 

We have seen (Problem 7.3) that the force produced by a scalar field 
is attractive, and therefore a scalar field is a candidate for the carrier of 
gravity. However, it does not work. Note that the density p(r) in (8.1.1) 
must be proportional to energy density, since f p(r) dr = àm. However, 
the density that couples to a scalar field is a scalar source, which for a 
point particle, we have seen, is 


pr) = V1 =v 8?(r -y,) (8.1.5) 
and f p,(r) dr = V1 — v?, which is not the energy. In contrast, the T°° 


component of the free particle stress tensor precisely integrates to the 
energy. From 


H v 
Te” = mZ- EVT (x - yp) (8.1.6) 
dr dr 
we find 
oo m 
| dr Tp = Ae (8.1.7) 


R. P. Feynman, 1962 CalTech lecture notes. 
2S_N. Gupta, Phys. Rev. 96, 1683 (1954). 
°W. E. Thirring, Ann. Phys. 16, 96 (1961). 
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For a collection of elementary particles forming a nucleus, atom, molecule, 
planet, etc., the stress tensor 7” is guaranteed to give the energy via 


w= | Tar (8.1.8) 


no matter how complex the system.* 

How decisive is the failure of the scalar coupling to yield the inertial 
mass? Suppose we consider a nucleus and take a scalar density p, = 
mp Vi — v? &(r—y,) for each particle. The integrated coupling of the 
scalar field to the nucleus would be 


M, =m, VI = v3 + (2) (8.1.9) 


where (?) stands for interactive effects, perhaps arising from other scalar 
sources in the nucleus. There is no reason to expect that (?) would correct 
the error in the factor V1 — v2, which is ~ mu;/m, or roughly the binding 
energy of the nucleus over its rest energy, Mev over Gev, or ~ 107, 
maybe 1074 or 107° with a little conspiratorial help, but huge compared 
to the 107" equality of gravitational and inertial masses. 

Another general way of seeing that there is difficulty is to suppose 
that the hypothetical scalar gravity is coupled to the natural scalar source 
density in a nucleus or atom, the trace of the “matter” stress tensor: 


T=T,", (8.1.10) 


where ‘matter’ includes the electrons and quarks and the gluon and elec- 
tromagnetic fields. Here, we recall that the trace of the electromagnetic 
stress tensor vanishes; therefore, its contribution to 7,,” cannot match its 
contribution to T°°, so we would have a correction to the equality of 
gravitational and inertial mass of order 12/781, a few tenths of a 
percent or more. In addition, gravity would in first approximation not 
deflect light. 

It would clearly take a remarkable conspiracy to cancel out all these 
problems and restore the known equality of the two masses, gravitational 
and inertial. We must consider the pure scalar theory of gravity decisively 
ruled out. We turn, therefore, to the next simplest possibility, a symmetric 
tensor field ¢,.,. 


*We here exclude internal gravitational energy. 
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8.2. THE TENSOR FIELD 


We expect to find equations resembling 
ða 0 Pop te= -ATy, (8.2.1) 


where 7,, is the matter stress tensor, and +--- allows the addition of 
terms like 0,0° da, etc., as required to give a positive energy. A is a 
coupling constant; the minus sign is a convention. 

As in our discussion of vector fields, we have two ways to proceed. 
We can write the most general Lagrangian density that will give equations 
like (8.2.1)—that means bilinear in ¢,, and 0,¢,,—and adjust the 
constants to give positive energy. This is possible, but difficult. It is much 
simpler to repeat the process we used earlier, that is, to require that the 
source T,,, radiate only fields possessing in their characteristic coordinate 
system (rest system for particles, wave number zero for fields) only the 
five components associated with a three-dimensional symmetric traceless 
second-rank tensor. As in Section 7.5, we work with the four-dimensional 
Fourier transform of the field variables. In order to carry out this program, 
we must start with a massive field. The equation will be 


(ða O% + p?) bP” = -A POY, TM (8.2.2) 


where P is a projection operator that eliminates the unwanted compo- 
nents. 

How do we eliminate the unwanted components of p»? In the 
k, =0 coordinate system, we require that the three-dimensional scalars 
and vectors that we can form from the tensor ¢,, all vanish: 


Meu 


doo = 0, dio = doi = 0, and bi = ob; =90 (8.2.3) 


r=] 
for a propagating mode, that is, for k* = k,k° = p’. This elimination can 
be expressed covariantly: 


k°d,,=0 and 6,°=0 (8.2.4) 


reduce to (8.2.3) when k‘ = 0. 

We have thus reduced the ten component symmetric tensor (by 
dropping the three-vector, ¢o,;, two three-scalars, doo and œz) to a five- 
component traceless, symmetric three-tensor. 

We write the most general covariant operator P°”,,, symmetric in 
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pv when operating via P’”,,T*" on a symmetric tensor, and satisfying 


k PP’ rg =0 and np, P?” o = 0 when k = p’. 
The most general structure for P is 


PP”, = BRE + PK Kk, +E ak k, 
u‘ p? 


PLY 


py yk Cir i(k? KR, SY + k”k ô?) (8.2.5) 
Be H 


where a, B, k €, and € are adjustable constants. We first trace pv, 
setting k= B? 


Nove’ ar Spy ctp al ie aay 28 k, 
P w 
2e 
+ YNar + 4ENna + ~ kka = 0 (8.2.6) 
H 
or 
(1+ y+4é)=0 and (a+4B8+2e)=0. (8.2.7) 


Next, we multiply with £,: 


kP” io = kað; + 2 kikket Ê k'k, k, 
H u 
+ Yk” nant ER’ art €k OE + Skok yk, =0 (8.2.8) 
H 


so that 


l+e=0, at+Bt+e=0, and y+ €=0. (8.2.9) 


The solution of (8.2.7) and (8.2.9) is 


1 1 
; =-=, E= =i aera cine 8.2.10 
a=; y=; €= — 3. (8.2.10) 
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Our equation for ¢°” follows from (8.2.2): 


ee ee Rs L py Kako ne 
(£ ~ u2) "y= afr" 3 Pe pT 
1k°k” 1 
+— T Th fe ote pv y ts 
3 2 Na 3 Nn N 
1 p Av v Àp 
~—(kPk i T” + k*k T™) (8.2.11) 
H 


where the coupling constant A will be determined later, and k* is no 
longer subject to the constraint k? = p.° 

Equation (8.2.11) has the property that the modes k°¢,, and n,,@°” 
will not be radiated—that is, the pole 1/(k* — u°) in the solution of 
(8.2.11) will be canceled for those modes. 

We now turn (8.2.11) into an equation for °”; in its present form, 
it is not simple to take the limit u — 0. We follow the same procedure 
as in the vector case. We express the objects that appear divided by u’ 
in terms of $°”. These objects are T= 7,,7T°, T?=k,T™, and 
T =k,T°. In terms of these, (8.2.11) is 


2k? k*— 1 T 
k? — p’) o’ =a fres? T +o” 
( H )¢ v 3 u’ gt pe 
PLY 
ee pba T-A (kT + kT”). (8.2.12) 
3 p 3 H 


Call nb” = $, kap™” = b”, and k,k, od” = ġ. Now trace (8.1.2): 


2 T 2 
(@- worst Sb slr -tr-Srt (8.2.13) 
3u 3u 3u 3 H 
Next, multiply (8.2.12) by kp: 
2 
(P-wenafres oe ta PET eer 
3 3u 3u 
1 1 2v vp 
- herr (eT +k) (8.2.14) 
H 


‘We apologize for the use of A both as a coupling constant and a Lorentz tensorial 
index. Unfortunately, we have run out of suitable Greek letters. 
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and now multiply (8.2.14) by k.: 


Le? bk 1 


R= oat +i ors -T--kK’T 


k? - p’? 1k? 4\ a/2k? 4 2 
Seg a ee 2 
A 3p 3 3u’ 3yo p 
and 
(k? — u’) 5 P 
ps S[k? 5 k? 
6 = Ta{S-1) +71 425-24 
A 3 H 3u” 3u 
so that 


Substituting back into (8.2.14), we learn that 


AT” a i 
2 "$- o 
H 


and finally putting T, T and T” back into (8.2.12), 
(k? — p?) pP” = kph —k* bo + kek’ 
- nt"[(k2- w2)o- 3] = are 
or in coordinate space, with k, = 1/i4,, 
(an0? + pw?) h?" aarp” — aarp”? + APA" hy” 


= 9? *[(Ar,0 + pw?) be” ~ dadog] = -ATP”. 


(8.2.16) 


£) (8.2.17) 


(8.2.18) 


(8.2.19) 


(8.2.20) 


(8.2.21) 


Equation (8.2.21) tells us the number of propagating modes of 6°”. 
Of course, for u #0 we know the number will be five by construction, 
just as the number for a massive vector field was three. We first note that 
whether or not wu is zero, (8.2.21) involves no second time derivatives of 
the four quantities #°°; hence, the equations for them are equations of 
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constraint. For u + 0, however, there is one more condition, obtained by 
operating on (8.2.21) with a,: 


H7(dp6°” — ap) = —Ad,T?”. (8.2.22) 


Thus, for v= j, we have constraint equations on ðo”. However, for 
v= 0, we find 


d°( bi -— 6) = a oi = ki -—a,¢", (8.2.23) 
H 


eliminating one more propagating mode. We will see later that for u = 0 
a different mechanism takes over. 

In order to have a theory that treats consistently the generation of 
gravity by matter and the action of gravity on matter, we need a Lagran- 
gian that includes gravity, matter, and their interaction. We turn now to 
a Lagrangian formulation. 


8.3. LAGRANGIAN FOR THE GRAVITATIONAL FIELD 


We write the Lagrangian density as 


Le= Porrpov alan ye +] t u porborlan? n” +) 
(8.3.1) 


where all possible pairings of the six and four indices must be included. 

The number of independent pairings of the four indices associated 
with the u” terms in (8.3.1) is two: ¢,,¢%” and $£¢7. The total number 
of pairings of the six indices in (8.3.1) is 5 x 3 = 15. Of these, only four 
are independent: A paired with A’ gives two (as in the u? terms above). 
A and A’ paired with pv gives one; A paired with pv and A’ with p’v’ gives 
the last one. All other pairings are reducible to these four by symmetry, 
relabeling, or integration by parts. 

Our Lagrange density therefore must take the form 


1 
Ly = z Po rd pP” + aðr pard” H + BA, G20" HF 
+ côp p" PR 


2 
$ ACZ + Bopp”), (8.3.2) 
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where we have chosen the factor 3 to give the conventional positive 
coefficient for the “natural” term, (bi)? /2. Note that because 6°” = 
p”, 6,” = $”, and can be written @, with no ambiguity. In order to work 
out the Lagrange equations, it is convenient to keep track of indices by 
writing (8.3.2) in the form suggested by (8.3.1). That is, 


1 i J l g woody ve pv’ 
Pe = Arbprdn bp X {5 n a tan n” n” + bye ne n 


2 
ty 15 H ! py vp! 
+ en? nP” nÀ } + D boboran” n + Bnn”? (8.3.3) 


and 
OL, 


E apprit +i F 8.3.4 
cea) porli d+ ¥] (8.3.4) 


where { } is the first bracket in (8.3.3) and { }’ the same bracket with 
primed and unprimed indices exchanged. 
Similarly, 


IL, _ p? ; 
T Aal 1+{ T} (8.3.5) 


Thus, the Lagrange equations are 


baða Pow] N N n~ + 2an” n” n 


DE 2bn?’n? n $ cnn?” 1 wy co nn | 


2 
= z Pov [2ann” + 2B” | (8.3.6) 
or 


Ba [Orp + aa” ph + 2b ba” + e| A" ba” + inh] ] 
= pag?” + Br’ db." |. (8.3.7) 


We must still symmetrize in pv, since the only permitted variation of 
p™, 56°”, must be symmetric in p and v. The equation is then (with 


$= pa) 
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aad PH” + afora” pr^ + aa p>) 
+ cafa” p + 9°"(2bd,8 b + cdrd) 
= p° (ap + Bn?’d). (8.3.8) 


Comparing with (8.2.21), we find a = —1, c = 1, b = —1/2, a = —1 and 
B= 1, so Lis as follows: 


1 
A = > [a P 0" HP” T 28, bpd" 


+ 20, 6°°0,6 — 1.606 ~ pbb" + p?p’). (8.3.9) 


Before continuing, we wish to verify that the energy of the free field 
described by the Lagrangian density (8.3.9) is positive. 
Note first that the second term in (8.3.9) may be rewritten as 


ða prd P~ = 0°h5,d,6” + total derivatives. 


We may therefore calculate the energy from the canonical stress ten- 
sor, dropping every term except 


1 
L= [ndot ubao] (8.3.10) 


since all the other terms are bilinear products of expressions, each of 
which vanishes by the equations of motion. The energy is therefore 


1 a Ace 
Po = 5 | ae[ bod + diodio” + p? pop”). (8.3.11) 
We proceed here as we did in (7.5.20) by Fourier expansion: 


Pov = f d?k bp ETD + cc, (8.3.12) 
so that Po becomes 


Py = (277)? | d>k(k* + u? + kd) ypy. (8.3.13) 
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Wov is restricted by the conditions 


kee, =0 and yn?" = 0. (8.3.14) 
Thus, 
k? Pov + k’ Wis = 0, (8.3.15) 
k Woo + k'i = 0, (8.3.16) 
and 
k® Wo; a ki dh = 0, (8.3.17) 
so that 
kik 
Poo = po po Mi (8.3.18) 
and 
k'y 
Poi = Yio =-=, (8.3.19) 
ko 


We see that Yoo and po; are not independent degrees of freedom, nor is 
hii = Woo. With k in the three-direction, there remain as independent 
components p33, P32, Wai, Wiz and yy) — Wo2. The term pii + bo = 
Yoo — W33 is already determined by 33. Substituting (8.3.18) and (8.3.19) 
in (8.3.13), we find for y,,,,p*?” 


2 


iki. |2 ‘wh 
k'k by - 2/4 + de? (8.3.20) 


kè ko 
= |y Et — 2e*| t + ll? 
= | Ya3|?(e* — 2e? + 1) + 2(1 — €7)( Jai? + | sal?) 


+ 2l? + pal? + [yl (8.3.21) 


which is positive-definite, since € = k/ky < 1. 

It is interesting to consider the limit of this theory as u”— 0 and 
(1 — €°) = «*/w* 50. In order to make a nonzero contribution to the 
energy, | 3, |° and | y32|? must go like 1/w*, and |433|" like 1/44 as u? > 0. 
We can find their actual behavior from (8.2.12). For the limit 47> 0 to 
exist at all requires a conserved source, so we take 


k T” =0. (8.3.22) 
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The equation for Y°” becomes 


2 2 py pv 1 k? k” pv 
aaa We |e a8, (8.3.23) 


where T = np, = T°”, as before. We note the singularity of Y” as u? — 0: 
It is uniquely in the amplitude 33, since k is in the three-direction, and 
makes |33|? go like 1/u* as u? —>0. The mode 


Y 4 yr = yr — y? 
= (€? — 1) y” > constant (8.3.24) 


as u°— 0. The modes y*' and y?” are finite in the limit and therefore 
carry no energy. The independent modes that survive and carry energy 
are therefore y! — y”, w'*, and y”. As we shall see later, the u? = 0 
theory, based on (8.2.21) with u? = 0, has only two independent modes — 
essentially, yt — y” and w'?. The limit of the nonzero p” theory as 
u? — 0 exists and is different from the u? = 0 theory. 

We will come back later to a discussion of the free Lagrangian— 
propagating modes, zero mass limit, stress tensor, etc. We wish first to 
study the system gravitational field plus point particles. 


8.4. PARTICLES IN A GRAVITATIONAL FIELD 


We write the action as the sum of three terms: 


S=S,+5, +8; (8.4.1) 
where 
Se = (aate (8.4.2) 
and 
mp, [ dyh dy, 
S,=-% ef P qun 2 do, 8.4.3 
3 P 2 do, i dop á ( ) 


as discussed in Section 7.2; for the interaction we take the simplest in- 
variant, nonderivative coupling: 


dyp dyp 
S= -AÈ m, f o b uu(Yp) dop. (8.4.4) 
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mp is inserted by hand to guarantee the equivalence principle, that is, 
acceleration at a point independent of the particle. We could also add a 
coupling to the trace ¢= y""d,,, but that would be in some ways 
equivalent to adding a scalar field and subject to the same experimental 
problems. 

It is obviously simplifying to define 


Eurl Yp) = Nur + 2A $url Yp) (8.4.5) 


The particle Lagrangian can then include the interaction by writing 


es dyp dyp 


do, do, 


Sp = Euv Vp) dop. (8.4.6) 


The equations of motion are 


d dyp _— 1 dyp dyp 


-gu Z= e R g 8.4.7 
do,°""do,  2do,do, nA et) 
or 
d’ ys, _ 1 dy} dyp dyp dyp Buv 
am 2 Sa = OuBig. 
op 2do, de, do, do, y` 
= — LA a (Bae q an _ 28w) 
2do,da, ay” ay" 
or 


any dy) dy; 
To u Yp Yr =0, (8.4.8) 
do; do, do, 
where F is defined in (7.6.40). 

We determine do, by calculating the o “energy” “W” from 


, _ _ Mp dyp dy, 
L, Fee eae a ek Epi 
2 do, do, 
as in (7.2.12). It is . 
dy’ aL}, 
“W” = aYp n = L, 
do, (2) 
do, 
My dyp dyp 


tea eo iep 8.4.9 
2 do, do, he on 
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so that (do,} = const. X dy# dy} g,..{y,). As in (7.2.11), we may choose 
const. = 1 by adjusting m,. Thus, 


ù V2 
doy = (dy} dyp 8u(¥p)) (8.4.10) 


and in the equation of motion (8.4.8) da, is to be understood as given by 
(8.4.10). 

The field equations now follow. Once we have chosen the Lagrangian 
to give the effect of the field on the particle, the particle’s effect as a 
source of the field is determined. Then, the field equations follow from 
5S/5¢,, (with 7, the gravitational field mass,° set equal to zero). Thus, 
with 


4 
S= [Fie hurd GY” = A Prd" P” = dapur" p” 


dyp dyp 


+ 20,0" d,¢- agag] - a dg do, Eup) do», 
we have 
= as = anap” = a“ a, 7°" — a”a p’ 
pv 
+ atap- nh [58° — adop] 
dy} dy, ) 
i Sia (am PP g (yp) )=0. (8.4.11 
Shure Op P do, do, Pu (Yp) ( ) 


The last term in (8.4.11) is transformed via 


| do oy Yp) = fasse — yp) Oy Ax) do 
to 


Zine dyp m p f do ,84(x — yp(Op)) (8.4.12) 


do, 


*More precisely, in classical field theory, 1/4 is the Compton wavelength of the field. 
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which we will call AL, T$” where T$” is the particle stress tensor: 


dy) dy; 
THY =m, a ae do, 5'(x — y,(ap)) (8.4.13) 
doy dop 
with 
1/2 


do, = (dxf dxp gu (yp)) 


Equation (8.4.11) becomes 


adp” — ata rp — aarp + a4 a" 
= 1P(A,9b — AAG) = — ATE". (8.4.14) 


Equation (8.4.14) has two remarkable properties. First, if we operate 
with ð, on the left-hand side, we get zero identically. Therefore, the 
equation is inconsistent, since 0,7” is not zero. It is not zero because 
there is exchange of energy and momentum between matter and field, so 
that the matter stress tensor alone cannot be conserved. However, if the 
field is weak, then to lowest order, T$” is conserved. In fact, from (8.4.13), 


dyp dyp 


ð TE” = -m,& (x-y, 
g i ( Yal tde, 


Fiap) 
= as WPT eae 


It is clear what program might be followed. Instead of T5” on the 
right-hand side of (8.4.14), we should have 7””, the total stress tensor of 
our linear theory. This way of proceeding poses two difficult problems. 
First, it is not clear how we should choose the 7” that is carried by the 
gravitational field. We saw in Section 7.7 how to construct a conserved 
tensor from a Lagrangian density. However, since the Lagrangian density 
can be modified by adding a derivative (without changing the Lagrangian 
equations and therefore without changing the theory), there is an infinite 
set of possible T“*’s, all conserved and symmetric. Second, if we modify 
the right-hand side of (8.4.14), the new equation will conserve a different 
stress tensor. To find the new stress tensor, we will need to find a Lagrang- 
ian for the new equation, and so on, ad infinitum. This process can be 
carried out, but we will not do so here.’ Remarkably, both problems are 
solved exactly by Einstein’s general theory of relativity. We shall see how 
this is done in Sections 8.7 and 8.8. For the moment, we will consider 


TIt is discussed in some detail in Feynman’s notes, previously cited in footnote 1. 
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(8.4.14) as an approximate equation and deduce its immediate conse- 
quences. Therefore, imagine an extra term 57”” on the right-hand side 
of (8.4.14); this extra term makes the equations consistent, but is small. 
We shall need to find it later in order to calculate the precession of 
planetary orbits. 

The second remarkable property of (8.4.14) is that it no longer (be- 
cause u = 0) determines ¢,,,. It is, in fact, invariant under the transforma- 
tion 


pers bt? + are’ + a E” (8.4.15) 


where the four &" are arbitrary functions of x. 

The transformation (8.4.15) is analogous to the gauge transformation 
of the potential A, in electrodynamics. There are two important differ- 
ences, however. The first is that the gravitational field F ¥, which, accord- 
ing to (8.4.8), determines the motion of a test particle is not invariant 
under the transformation, unlike the electromagnetic case. The second is 
that a coordinate transformation can eliminate the change in the gravi- 
tational field. To see this, we observe that the action S, in (8.4.6) is 
invariant under the gauge change (8.4.15) together with a transformation 
to new coordinates? 


Xp = Xp — 2AE,(x), 


all to first order in Ad, and Agp. 

The existence of this invariance provides us with a rigorous but difficult 
way of dealing with the gauge problem: Formulate all experiments in 
terms of gauge invariants. In practice, one does less: One usually assumes 
that experiments are carried out in gravity-free regions, where the only 
coordinate ambiguity is that associated with Lorentz transformations. That 
is a not very subtle and not very satisfactory way of instructing the reader 
not to worry too much about the choice of gauge. 

We wish before proceeding to make sure that the weak field, low- 
velocity limits of (8.4.8) and (8.4.14) yield Newtonian mechanics. We try 
to solve (8.4.14) for a heavy, stationary source (like the sun), with mass 
Me. 

A convenient gauge that decouples the tensor components of (8.4.1) 
is defined by the condition 


aor" = 5a. (8.4.16) 


The alert reader will note here the first appearance of general covariance, albeit in 
approximate form, 
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This gauge is called harmonic. It is analogous to the Lorentz gauge in 
electrodynamics. 

To see that we can always go to this gauge, suppose 0,6” — 5a°b= 
W” #0. Then we let p” — p” + (a E + a”E") and demand 


aple + a"é") — 0" PE, = — yr" (8.4.17) 

or 
OR = - (8.4.18) 
which is an equation we can solve for €”. Note that like the Lorentz gauge, 
the harmonic gauge is really a class of gauges, for transformation within 
which the gauge parameter é“ satisfies the four-dimensional harmonic 


equation 0, Aĉ” = 0. Hence, the name harmonic gauge. 
If we insert (8.4.16) into (8.4.14), we find 


BY 
ay argh” n = ða ard = -AT (8.4.19) 


where 7%, is the stress tensor of the sun. 
A further simplification results from the trace of (8.4.19): 


4 
Oa kar ap = -ATo (8.4.20) 
with To = Thy. 
Therefore, 
0,06 = ATS (8.4.21) 
and 


ag 
ða argh” we -ATH +4 a ard 


=- we 
À © 2 To . (8.4.22) 


For our source, dy œ/do o ~ 0, and for weak fields and low velocities, 
doo = dy%, yielding 


TS ~ Mg?’ (x- yo(yp)); (8.4.23) 


all other components of TẸ” are approximately zero. Let us define 1+” 
by the equation 
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Mc 
TS = To = y S(x Yo)t*”. (8.4.24) 


Then from (8.4.24), 1° =¢'! =1?? =t” = 1; all other components are 
zero. The static solution of (8.4.24) is then the solution of 


vege a Mesa- yom 
or 
A Mo th 
pe"(x) = E (8.4.25) 
4r 2 |x ER Yol 


We next substitute (8.4.25) into (8.4.8). We take the static field, low- 
velocity limit of (8.4.8). That is, 


2,4 
oe + re( 22) en (8.4.26) 
The equation for yy is then 


2.0 0,2 
de ra( 22) =0. (8.4.27) 


But for a static, diagonal $””, as ours is, Io = 0. Thus, (d*y,/do’) = 0, 
and we may take o = yp. 
The space components of (8.4.26) are therefore 


d’yp 


d(yp)” 


In the weak field static approximation, 


: 1 
Poo = z 2800 = 4:00, (8.4.29) 
and 
dyp 
=—Àð; 
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with 

(8.4.30) 
so 


a 


=G, (8.4.31) 
8r 


Newton’s gravitational constant. 


8.5. INTERACTION OF THE GRAVITATIONAL FIELD 


We consider a given gravitational field, 8u» = nyu, + 2A¢,.. We have seen 
in Section 8.4 the action of a point particle in the field g,.,: 


Mp dy” dy” 
S, = —-— | da gy, 8.5.1 
S 2 | j do a do ( ) 


with the resulting equations of motion [as in (8.4.8)] 


d°y” , dy* dy" 
ae | qe ve 
do? do do 


(8.5.2) 


We have left out the p subscript in (8.5.2) since from now on we will be 
considering only one particle at a time. 

If we let g,,, transform like a tensor under general coordinate transfor- 
mations (under which dy“/do transforms like a vector), the action (8.5.1) 
will be exactly, rather than approximately, invariant, and the equations 
of motion (8.5.2) covariant under such transformations. Further, the argu- 
ments given in Section 7.7 that show that g,,, can be transformed to g,., = 
Nur at a point, and F4,=0 at that point, hold here as well. Therefore, 
at each point there will be a coordinate system (the elevator system) in 
which the equations of motion are 


=" =0 (8.5.3) 


accurate up to and including linear terms in the expansion of g,., about 
&€ = &. At the center of this freely falling clevator system, masses move 
under the effect of all other than gravitational forces: The gravitational 
force has been eliminated. For this to hold over the entire elevator, the 
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elevator must be small enough so that the effect of the neglected quadratic 
terms in the expansion of g,, about £ = £o is small compared to that of 
the other forces at work. 

The principle that all effects of gravity should vanish in the elevater 
system was called by Einstein the principle of equivalence of gravitation 
and inertia. 

At this point, we must ask about other interactions. In particular, it 
does not seem possible that all particles move freely in the elevator frame 
unless, in that frame, all the laws of nature take their gravitation free 
form. 

This apparent extension of the equivalence principle is certainly neces- 
sary to make the original limited principle consistent. For example, if it 
were not true, the electromagnetic contribution to mass, which is different 
for different atoms, would show up in the Eétvés experiment (and its 
children and grandchildren), now accurate to one part in 10'’. The action 
(8.5.1) shows us how to accomplish this more general goal: We use the 
gravitational tensor g,. to make the Lorentz-invariant action generally 
invariant and the Lorentz-covariant equations generally covariant. Thus, 
we take for the action of the electromagnetic field interacting with charged 
particles, as in (7.8.7): 


nape dy% 
S= -D S g, ZE do, + E | ap PEA, do, 
p 2 do, do, P do, 


= TS VeA, - 0,A,) g `g (OLA, —3,An). (8.5.4) 


In (8.5.4) we have taken the fundamental field A „ to be a covariant vector 
and the particle displacement dy“ to be a contravariant vector. The fields 
Fav = (0A, — 0,A,) are given by ordinary derivatives, since, as we have 
seen, the ordinary curl is a tensor. 

The equations for the electromagnetic field are then 


F”, =j” with 


4 — 
j*Qy) = a» | do Se VO We and from (7.6.43), 
Ve da, 


Op Fiat OF + OP xp =0. (8.5.5) 


The relativistic quantum wave equation for a charged scalar field y 
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with charge e would be the covariant extension of 
[ (a. — ieA„)(0” — ieA”) + m?] b= 0, (8.5.6) 


or 


[g"" (ð, — ieA,) b]., — ieA nag", — ieA,) b+ m°h=0. (8.5.7) 


We shall use this equation shortly to study the gravitational red shift. 

We take up now three consequences of the tensor theory developed 
so far. These are the bending of light, the precession of elliptic Keplerian 
orbits, and the gravitational red shift. 

We discuss the bending of light and the orbit precession together. In 
both cases, what is at issue is the orbit equation for an object in a given 
static gravitational field. We treat the light ray as a rapidly moving particle 
{vu ~ 1). 

We start from (8.4.7): 


d ( a 1 dy` dy" 
— i =-9 roo. 8.5.8 
da Su da 2 nee do do ( ) 
We consider the case where 
800 = 8o, By = — 8 Ôj, Boi = O, (8.5.9) 


and go and g, are functions of |y| = r alone. We found, from (8.4.25) and 
(8.4.31), 


_ 2MoG 


g&=1 


and gsi oe 
r 


(8.5.10) 


We consider here the more general case [in which (8.5.9) still holds}, 
where we have corrected the right-hand side of (8.4.14), as discussed 
below in Section 8.8. Under these conditions, (8.5.8) has three integrals. 
First, 


aan —]=0, 8.5.11 

ds BY do ( ) 
so 

dy? 


go deS constant = W. (8.5.12) 
o 
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Second, from (8.4.10) 


dy” dy\ 
(he (Barras 


and third, from the space component vector equation 


d d 1,/d dy” dy” 
= 8s y = s sa) yo ’ 
do da 2° \dy do do 


where § = y/y and y = |y|, we derive angular momentum conservation: 
d 
L=g.y Xx I = constant. (8.5.14) 
do 


We learn from (8.5.14) that the motion stays in a plane; we choose 
polar coordinates r and @ in that plane. Then (8.5.13) becomes (with 
dAldo = A) 


go y”? _ g (P? + 6") =1, (8.5.15) 


(8.5.14) becomes 
grb =L, (8.5.16) 


and substituting y° from (8.5.12) in (8.5.15) yields 


w? i 
oe a IPP) =] (8.5.17) 
0 
or 
2 
(2+ 76%) =4 (= = i). (8.5.18) 
&s 80 


We change to 0 as an independent variable, using (8.5.16): 


2 2 2 
(=) +?) L42- i). (8.5.19) 
dé gsr &s \ Bo 


The substitution u = 1/r gives 


e) + | = er z 1). (8.5.20) 
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The differential equation for the orbit is found by differentiating (8.5.20): 


2 r 2. $ 
CU ges p A 1)|. (8.5.21) 
d- ðu L2 L* 8o k 


We see from (8.5.10) that expanding g, and go in powers of 1/r = u is 
equivalent to expanding in powers of 1/c*. Thus, since we are considering a 
slowly moving body (a planet), we may expand 


= 2 
Le£) = A + Bu+ Cu? t... (8.5.22) 


Equation (8.5.21) becomes 


du 


PEY +u(1-2C)=B (8.5.23) 


or 


B 
u — ——-— = mcos (1 — 2C)'7(6 — 0 
1- 2C 4) ( ba o) 


or for small C, with 8o set to zero (choice of axis) 
u = B + w cos[(1 — C)(@)] (8.5.24) 


for a precession angle 2rC per orbital year. 

We have seen earlier that our linear field equations are not 
consistent—the right-hand side is not conserved if the particle is accelerat- 
ing. When we learn how to deal with this problem, we will find corrections 
to g,, that contribute to C in (8.5.22). We will return to (8.5.24) when 
we have that information [see (8.7.23)]. 

We turn now to the bending of light by the sun. We treat the light as 
a fast particle with uy — 1. In this calculation we need only keep the weak 
field limit, that is, the linear term in the expansion of g, and go in powers 
of the potential 2MoG/r. The constants W and L that appear in the 
equation can be calculated from their values far from the source 
Ma: L = bvg V1 - v5 and W = 1/V1 — vå. Thus, the orbit equation for 
Vo —> 1 is [from (8.5.20)] 


2: 
(=) ppt 
do b go 
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X Lf yM c) + o(( McG) (8.5.25) 


b? r r 


and 
2 


u" + w= MoG (8.5.26) 


where b is the impact parameter of the light ray passing the sun. 
The solution of (8.5.26) is 


2MoG 


b? 


—acosô, (8.5.27) 


where a@ is a constant. We have again chosen our axes to make 6, = 0. 
We determine the coefficient a by differentiating (8.5.27) with respect 
to a: 


do 


1 dr 
-= — = asin 8.5.28 
r do o ( ) 
As r — %, before the scattering, 
dr Uo 2 dé bvo 
— =- and a e a 
do Vl- vo do VI — v6 


the + sign depending on the initial sign of 6, which without loss of 
generality we take to be positive. a is then given by a = 1/b sin 6; and 
(8.5.27) becomes 
1 2MəG  cos@ 
r b? b sin 0; 


(8.5.29) 


where 8; is the initial direction of 6 from the scatterer, or with € = r/b 


SESS, (8.5.30) 


where € = 2M.5G/b is a small dimensionless number. 
For é> >, 


(8.5.31) 
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the two solutions of (8.5.31), 6 = +6,, give @ before and after the scat- 
tering. For e = 0, 6, = + 7/2, so the particle comes in along the +y-axis, 
goes out along the +y-axis, and is undeflected. The scattering angle is 
therefore the difference between 26, and m. Since e is small, the equation 
for 6; can be solved: 


0 = e- cot 6; 
and with 
6, = [z z 8| (8.5.32) 
2 
we have 
e-5=0. (8.5.33) 


The scattered angle is therefore, without paying attention to sign, 


4M: 

|a| =2e = M: 
b 

or in ordinary units 


ga Een (8.5.34) 


To find the nature of the trajectory (attraction or repulsion), we have to 
trace out the orbit from (8.5.29). We write again 


1e- e8 
é sin 0; 


(8.5.35) 
We start with 0; ~ 7/2, 6 > 0, 1/ë = e — cos @ so that (see Figure 8.1) 
e- cos 0, =0, 0; = 


Clearly, @ must increase from 6; to keep 1/& >0. Thus, the trajectory 
circles the origin, ending at @,= — 7/2 + e. The particle is attracted to the 
scatterer. 

We may compare (8.5.31) with a naive (sometimes called Newtonian) 
application of the equivalence principle, which would have a light wave 
packet accelerate under gravity according to the local gravitational field. 
Thus, for a small deflection, we can calculate the transverse change in 
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Figure 8.1. 


velocity Av from the unperturbed straight-line motion of the light. That 
is, 


ps 


Av = f dta, (t) 


-x 


with 
Gr 
a; = Mo =n 
i 
The result for impact parameter b is 
c cb 


half the correct result. 

We take up next the gravitational redshift. We note first that it follows 
directly and simply from the equivalence principle, as is shown in Problem 
8.1. 

A general argument for the effect is based on the tensor g,., itself. 
Consider, in a static gravitational field, a stationary clock at point A with 
an intrinsic period r4 and a stationary clock at point B with an intrinsic 
period 7g. Communication between them is by a fixed-frequency light 
wave. This is possible because g„, is time-independent. Call the period 
of the light wave T. 

Now consider two events at A: two clicks of the stationary clock. The 
light wave will resonate with the clock if T is the time (nor the proper 
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time) between ticks of the clock, that is, if T= Ay°, and 
Vgurdy*Ay” = T4. 
Since the clock is stationary, our resonance condition is 


T V Za0(A) = TA. 
Similarly, at B 
T V Zoo( B) = Tp. 


Therefore, 


Tr _ | Soo(B) 


TA Boo(A ) 


and the frequencies wg and w, have the ratio 


wB | wo(A) 
ws Y Boo(B) 


Thus, if A is deep in a gravitational potential and B is not, wy/w, < 1. 
That is, the atom at A appears to be red-shifted. 

We can also give an explicit mechanism for this to happen. We assume 
a g,, that is time-independent and imagine an atom in the sun and a 
different atom on Earth exchanging a light signal of definite frequency 
wo. We then calculate the resonant frequency wo of the two atoms in 
terms of the gravitational potentials at the two positions and the field-free 
resonant frequencies wg. We will be solving an atomic equation at each 
site, so that the variation of the gravitational potential at each site will be 
negligible and may be ignored. Thus, we imagine g,, to be constant at 
each site, but different from one site to the other. 

The atomic equation will be (for a scalar particle with charge e) 


[(0,. — ieA u) (0, — ieA,) + mly = 0, (8.5.36) 


where we have replaced covariant derivatives with ordinary ones, as dis- 
cussed above. The Klein~Gordon equation (8.5.36) with constant g*” and 
time-independent A,, will permit a time dependence 


y= ye E (8.5.37) 
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where y is time-independent. The result is an eigenvalue equation for E; 
the differences of E between two states of the same atom will be the 


exchanged frequency wo. 


Before writing the equation, we transform to a coordinate system in 
which the three gi’s are zero. This can be done by shifting the time origin: 


so that 


and 


0 = gin = 


x= + A(x), «=x! (8.5.38) 


ax ax” 
ax" ayo be” 
âx” 
=" 2 0 
ax” H 
ax? ax? 
epot 7, 800 
Ox 


A 
= gio + gE (8.5.39) 


CUER Bi, (8.5.40) 


This equation can be solved—always neglecting the variation of g;o/8goo. 


The solution is 


A= — 82, (8.5.41) 
800 


Note also that (8.5.38) leaves goo unchanged. 
We now return to (8.5.36), with the insertion (8.5.37) and the space 
components of the vector potential equal to zero. The equation is 


[-g™(E + eA)’ + 0;8,87 + mlx = 0. (8.5.42) 


The vector potential A° will be given (in non-rationalized units) by 


—a,g/a;A° = 4rj? (8.5.43) 


366 Gravity 


where j° is the charge density of—as a model—a heavy proton: 


where —g; is the determinant of the three-dimensional matrix g;j: 
23S det Rij (8.5.45) 


Since both (8.5.42) and (8.5.43) are spatially covariant equations, we 
can transform them to the form 


[~E + eAyy - V7 + m? | vy=0 (8.5.46) 
and 
5°(x’) . 
Y2 A? = 47e, (8.5.47) 
: V 00 

so that 

Pee aid Agee Veo (8.5.48) 
V goor! l 


The eigenvalue equation is now 


2 
(-(ve" E 5 syak my =0 (8.5.49) 
z 


and Vg™ E = Ep, the solution of the eigenvalue problem in the absence 
of a gravitational field. Taking the difference of two states, we find, with 
AE = Awy, 


Vg" Wy T Wr, (8.5.50) 


SO Vgo Ox = wo, for the two atoms that are exchanging the signal. Thus, 
in the sun, where 


800 = I+ 2d, (8.5.51) 


with @ the gravitational potential, goo < 1, whereas on Earth goo will be 
much closer to 1. Therefore, the light emitted by the solar atom will have 
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a lower frequency wo than wp, the frequency that would be observed for 
the same atom in a field-free region. This calculation shows how general 
covariance produced a mechanism for the gravitational red shift. 


8.6. CURVATURE 


We are now ready to confront the inconsistency of the linear field theory 
with which we have been working. The clue is the equivalence principle. 
We have seen that the equivalence principle for particle motion or electro- 
magnetic field equations could be guaranteed by making the equations 
generally covariant with respect to the second-rank tensor g,,. However, 
for g,,, to transform like a tensor, it must satisfy covariant equations. The 
problem is therefore to construct a covariant function H“*~ of the g’s 
and their derivatives and set it equal to the assumed source — presumably 
still the correctly calculated stress tensor for matter, Thr. If so, the 
equation must be something like 


2 
Hl, = a - .) = ATK. (8.6.1) 


The immediate problem is that the obvious way — introducing the covariant 
derivatives g,...,—does not work, since they are all zero. 

There is, however, a tensor function of the g’s that we can construct. 
We take advantage of the covariant derivatives of a vector field V*: 


ve, =a,Ve +P" (8.6.2) 
and 
VP ea = av", Bx Tie Kop ~ Lis V” (8.6.3) 


so that the tensor V“.,., — V" aw is given by 
V” a S V” aw = dala VE TEV SIRO + rV?) 
= a,(a,V"*TK,V") Tiaa V7 TAV?) 
(eee Ole) AT T 
= R" ya V” (8.6.4) 


where 


R” pa = AS, — OK, + PEL, — PETS (8.6.5) 
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Since V“.,., is a tensor and V” a vector, R“ „a is a tensor. It is called the 
curvature tensor. It has the following properties. 

First, if the tensor g,,, describes a space-time metric, and any compo- 
nent of R4,, is different from zero in a finite region, then no coordinate 
change can transform g,» to pseudo-Euclidean form. This is true since if 
it could, and since all the components of R4,, vanish in the pseudo- 
Euclidean system, and since R%,, is a tensor, all the components of 
R*,, would have had to vanish in the original coordinate system. 

Second, the converse, which we shall not prove, also holds: If all 
components of R¥,, are zero in a finite region of space, then it is possible 
to introduce a coordinate system that transforms g,,,, to pseudo-Euclidean 
form in that entire region. 

Third, the translation to the gravitational field g„, follows: If any 
component of R%,, is different from zero in some region of space-time, 
there is no coordinate transformation that eliminates the gravitational field 
in that entire region. Conversely, if all the components of R4%,, are zero, 
one can find a pseudo-Euclidean coordinate system, in which gur = Nya. in 
the entire region, the gravitational field @,,, is zero, and the gravitation- 
free laws of special relativity hold. The curvature tensor R4%,, carries the 
invariant reality of the gravitational field. 

In order to decide on the equations to be satisfied by R4,,, we need 
to know some of its general properties. These are most easily studied in 
the local coordinate system that makes g,,,= Nu. and dg,,/dx* = 0. In 
that coordinate system, at the chosen point, 


A a (OVA mA ~À 
R uv T R are aa a, | we 0,1 HK 


j| 
> 3 NO (Ou Lar + Or — WB wr) — {Ko v} (8.6.6) 


The covariant curvature 


te T 
R yy E EaR HVK’ 


1 
R Sre = z OI Baw t OVO ux = DOR uv z Ipua) . (8.6.7) 


We can read off from (8.6.7) the symmetry properties of Ro) since 
Ray iS a tensor, they will hold in general. These are: 


1. Ryu is antisymmetric in exchange of v and «x. 

2. Rays IS antisymmetric in exchange of A and p. 

3. Ray IS Symmetric in exchange of the pair (Aw) with (vk). 
4. There is one more algebraic relation: 
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R ys F R cur + Riesi = 0. (8.6.8) 


Equation (8.6.8) only adds one constraint (in four dimensions) since 
if any two of the four indices are equal, the sum is identically zero by the 
symmetry properties 1-3. 

We can now calculate the number of independent components of 
R yure by considering Rany as a symmetric 6 X 6 matrix —six being the 
number of values the antisymmetric pair (Az) and the pair (v«) can take. 
This number of components is 


OF Ee S21. 
2 


Subtracting the single constraint (8.6.8), we get exactly the expected 
number, 20 (see the discussion in Section 7.7). 

There is in addition one more identity, but this time it is differential 
rather than algebraic, the Bianchi identity, 


Ryu: + (cyclic permutation of vk?) = 0 (8.6.9) 
or in the locally flat coordinate system 

RQ ce + (cyclic permutations of vx?) = 0 (8.6.10) 
which is easily verified from (8.6.7). 


Two more tensors can be constructed from Rauwe by tracing. Note 
that the symmetry properties of Rause permit only one trace (to within a 


sign): 
R ax = g^” Rayo (8.6.11) 
so that 


R = N Riu 
1 
= 5 Fn In8 + pð” Sune — Ok Ruy a3 Ber) (8.6.12) 


which is symmetric in (u, k); a second trace gives the curvature scalar 


R= e""R nx 
= n" RO) = 0.0" g — 0° 8" 8 xn (8.6.13) 


Here, g = n° Bua. 
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8.7. THE EINSTEIN FIELD EQUATIONS AND THE 
PRECESSION OF ORBITS 


We recognize in (8.6.12) and (8.6.13) the two components of the equations 
for ,, that we found in (8.2.21) (setting u = 0). That is, remembering 
Sur = Nar + 2AP ur we found 


RO ge- a RO=- PTO (8.7.1) 


where T® „« is the matter stress tensor in the locally Minkowskian coor- 
dinate system. As in Section 8.4, the left-hand side of (8.7.1) is identically 
conserved: 


am( Rie = Tes po) =0. (8.7.2) 


Notice that now, however, (8.7.1) is consistent, because the right-hand 
side T°? is also conserved (the I’s are all zero!). 

The covariant equation that follows from (8.7.1) in a general coordin- 
ate system is 


R we aR = -XT (8.7.3) 
The conservation law (8.7.2) becomes 
g 
(r -2 R) = 0, (8.7.4) 
2 M 


which, of course, is the reflection of the Bianchi identity on the properties 
of the second-rank tensor R““. However, the right-hand side of (8.7.3) 
also has vanishing covariant divergence, since it is constructed from the 
covariant matter (including electromagnetic) Lagrangian as discussed in 
Section 7.8. The equations are therefore formally” consistent. 

Note that in arriving at the field equation (8.7.2), the weak field 
assumption has not been made, although it is clearly the case that the 
weak field (small @,,) limit of (8.7.3) reproduces (8.2.21); however, in 
this limit, the right- and left-hand sides are no longer consistent with the 


They are only formally consistent since the same kind of point singularity that occurs 
in other relativistic ficld theories occurs here. 
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particle equations and must be supplemented by the higher-order terms 
coming from the nonlinearities in R,, and T,,. We turn now to that 
problem, with the goal of finding the next approximation to the gravi- 
tational potential produced by the sun, and in particular of finding the 
correct value of the coefficient C in (8.5.22). 

We wish to expand (8.7.3) in powers of the potential V = GMa/r, 
which by the virial theorem is ~v*, and therefore small for v/c <1. We 
see from (8.5.21) and (8.5.12), since W ~ 1, that we need g; only to first 
order in V, but goo to the next order. Thus, we must find the next 
correction to doo. 

We write 


Rc = RQ+RDA 
where RCR is linear in ġ, R® quadratic, etc. In turn, g,,, is calculated as 
banat etn 

where gie = Nur Bur Bur = 2AP uv a 
dur = PR +R + (8.7.5) 
with ${)) given by (8.4.27): 


AM tuv 


Sar |x — xX 


$) = (8.7.6) 


with fog = fi = ta = f3 = 1, all other components zero. We expand by 
expressing R'” as a function of 6 and R™ as a function of the known 
$‘; we then solve the resulting inhomogeneous equation for 6. 

Before carrying out that procedure, we observe that (8.7.3) provides 
us with an exactly conserved, symmetric tensor 7“”, which is a nonlinear 
function of the @ field and a suitable candidate for the stress tensor of the 
coupled system. Since 


(0) 
a) _ Ear pd) 2 7) 
Rv A R? = Gix 


is identically conserved [i.e., 6“G%) =0, where ð” is not the covariant 


derivative, but the ordinary derivative], we must have that 


MY pr 
a ae agi (re = ROE (SR = Tn) (8.7.7) 


is also exactly conserved. Further, since T+” includes contributions from 
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all nongravitational sources (point particles, electromagnetic field, etc.) 
that reduce to known forms in the & coordinate system, the conserved 
tensor T”” correctly describes the exchange of energy and momentum 
between gravitational and other degrees of freedom. The tensor 7°” 
therefore makes it possible to calculate conventional energy, momentum, 
and angular momentum fluxes of gravitational radiation. In the present 
application, it makes it possible, by using (8.7.3), to calculate the next 
approximation to gy [as defined in (8.5.9)] and from that and (8.5.22) the 
precession of the planetary orbits. 

We choose a gauge (coordinate system) in which ¢® satisfies the 
same linear gauge condition as @"”, that is, 


1 
api = zaro” (8.7.8) 
where ¢ = ġ\ and all contractions are carried out with ny». Then 
ee = j 
Rix "A a 8 pw isi Àð ð Qux (8.7.9) 
and 


1 
R® = z208 = 4,0". (8.7.10) 
Equation (8.7.3) is therfore 
v 1 v 2 2) (8 u R)” 3 
Al dð Pue T z eÀ o} =A TuT (Re = Sea) + 0(¢1) 


+ 0(¢2¢1). (8.7.11) 


[Note that (8.7.11) would become an exact equation if instead of R™, we 
wrote R — R” .] Proceeding systematically, we rewrite (8.7.3) as 


a) 
RERO RAER R E 


2 Hr 
where 
0) — 1) 
go = Nyrs a > 2c), etc. 


We note that R” is zero outside the source to lowest order. Since contri- 
butions inside the source will give corrections to the 1/r potential, which 
causes no precession, they can be ignored. Then the equation we have to 
solve is 
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n v v 
RR(9?) - TERO(9P) + RACH) - = ROA) =0 (8.7.13) 


or 


Rio”) =- RR) (8.7.14) 
and, in the chosen gauge, 


Adap = —RO( A). (8.7.15) 


We first find a general formula for R® and then specialize to the case 
at hand: 


Rig Za Heer ere ary (8.7.16) 
and 
un l 
AAZ met 2 p` {ðn a Bor et dada Pon B dan Pue + dado Pun? 


on 7 a’ do 
2 4 


1 
= 5 anh dubon + (a, Outen t a, Pop 7 ðoPun) 


1 
+ go + AGL a pua iep + In Gd abon) (87.17) 


In (8.7.17), @ stands for ġ‘°, and all raising is done via n#”. However, 
note that with this convention 

ght = që” -- 2A” (8.7.18) 
although 

Suv = Nv + ZAP ars 
since 

Burg” = Oy. 
We now specialize to the case at hand: We set u = 7 =0, 0, =0, = 


0 since we are looking for time-independent solutions. This leaves (note 
the gauge condition on $"”), 


R l; 1 2 
ne = 2 $” 3:9; Poo + rh gy - 9 dor) (Jab = abo). (8.7.19) 
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The first term in (8.7.19) will be proportional to V7@o0 and, hence, 
confined to the source; it therefore does not contribute to the precession. 
There remains 


RGD = -24° 8,608.0" boo = 2A? (V hoo) (8.7.20) 


and (8.7.15) becomes 
-Vapi = —2A7(V hoo)” 


or 
s à? Me 
2) ie) 
V poo = y? P (8.7.21) 
and 
\°M2, A 
Q= A+B 8.7.22 
NO Arr r ( ) 


where A and B are integration constants that do not affect the precession. 
Our formula for go is then 


_2Mo , AMS 


=] — = 1+2Vt+ 2V? 8.7.23 
ee Sar 32m r? ee 
where 
Ve X Mo ae GMo 
Sarr r 


is the nonrelativistic potential of the sun. 
We now return to (8.5.20) for the orbit equation. Setting W ~ 1 — e, 


we find 
(S) +u2=5(1 2v)( costs ane 1) 
dé L 1+2V+2V 


2e 2V 6v? 


ie S E 


(8.7.24) 


The nonrelativistic equation would be 


typ HELD 
dé L? 


where E is the total energy, E = (mv’/2) + V. Therefore, the equivalent 
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potential for our problem is Vesp = V — 3V°, and the precession is given 
by the solution 


2 
u =B + w cos(1 — 3( SMe) Je (8.7.25) 


which returns to the same value of u at 
GMo\ 
0= 2n(1 + (72) ) (8.7.26) 


for a forward precession of 6r(GMo/ LY per revolution. 

Note that L ~ rv and GMo/r ~ v’, so that the precession angle is of 
order (v/c)*, equivalent to the scale of fine structure in atomic physics. 
(See Problem 8.8.) 

Note also that the second-order contribution to go was ~2V* out of 
a final 6V*. However, that division is not gauge-invariant; transformations 
of the form r’ = f(r) can shift contributions between different orders, since 
both first- and second-order potentials give precession of order (v/c)’. 

We close this section by constructing an action integral for (8.7.3). 
The Lagrangian density for the field g,,, must be an invariant. The simplest 
guess is, of course, R itself. We therefore try the action 


Sg _ f axve R, (8.7.27) 
Q 


where a is to be determined from the known equation of motion, with 


dX p 


dop 


(8.7.28) 


B 
Smarter F » Mp | doy ae Epp) 
P 2 Op 

We can verify our guess most simply by transforming, at a given 
point in space, to the coordinate system that is pseudo-Euclidean in the 
immediate neighborhood of that point. It is easily seen that Vg R differs 
by a derivative (or equivalently by an integration by parts) from a function 
of the g,,’s which is bilinear in the first derivatives of g,,'s. Therefore, 
at the chosen point, we may set g,, = Nya» in the Lagrangian, since the 
variation in g„, Will be multiplied by derivatives of the g,,,’s that them- 
selves vanish. We may not set the derivatives equal to zero in the Lagrang- 
ian, since this variation will lead to second derivatives, which do not 
vanish, in the equations of motion. However, since the local equations in 
this coordinate system are given by the weak field equations (8.2.21) (with 
u° =(), the Lagrangian must be given by the weak field Lagrangian, 
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(8.3.9). One verifies easily that the equality holds with 1/a = 167G. Note 
that the sign of the curvature scalar is an odd function of the sign of gp». 
Therefore, choosing g,, to have positive space components would result 
in a negative sign for a. 


8.8. GRAVITATIONAL RADIATION 


The equation satisfied by the gravitational potential @”” is given by 
(8.4.14). In the harmonic gauge, it is given by (8.4.22): 


aað ph” = ASH” (8.8.1) 


where 


SUP = ier ee (8.8.2) 


and T*” is supplemented, as in (8.7.7), to make it exactly conserved: 
a, T”” = 0; this leads to the harmonic gauge condition on S*”: 
v 1 v 
ô SE n S, (8.8.3) 
where 
S = SÀ = Mas. (8.8.4) 


As usual, we choose the retarded solution, to which we return shortly. 
First, we discuss the radiation itself. 
The propagation equation (for finite w, which we consider first) is 


(0,0* + Ww) bor = OO, Pae a ad, bry + 9,9, 


-= no((,d* + w?) ~ 079% gap) =0. (8.8.5) 


By applying the operator ð” to the left-hand side of (8.8.5), we find the 
constraint equation 


H(A dye << se) =0 


or, since u? # 0, 
por = 8r. (8.8.6) 


Equation (8.8.5) can therefore be rewritten as 
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(ana + B?) Qpr om: pð P = Noru$ A 0. (8.8.7) 
The trace of (8.8.7) gives 


wo=0 o =0, (8.8.8) 


and, thus, from (8.8.6) 
aop =O. (8.8.9) 
Equations (8.8.8) and (8.8.9) were the starting point of our investigation 


of tensor fields, so it should not be a surprise that we have recovered 
them. The propagation equations are thus 


(aaa + uw) dy, = 0 (8.8.10) 


which with the five constraint equations leaves five propagating modes; 
one sees from the Fourier transform constraint equation (with ky = w and 
k, =k, =0) 


woo + khz =O, (8.8.11) 
who + kbs; = 0, (8.8.12) 

and 
gon — $33 — Pir — $22 = 0, (8.8.13) 


that one can conveniently take as dynamical variables $12, d13, @23, $33, 
and $1, — $22. The other components can then be found from (8.8.11), 
(8.8.12), and (8.8.13). 

We turn next to the gravitational field, with u = 0. We note first that 
the four constraint equations (8.8.6) cannot be derived and, in fact, will 
not usually hold. There are, however, four constrained variables. We see 
directly from (8.8.5) that no second time derivatives of oo and ġo; occur, 
so that doo and œo; are constrained variables. The second derivatives that 
do occur are of the six linearly independent components 42, $13, $23, 
oi, + doo, bi, + 33, and dy + 33. We still have the four freedoms of 
gauge choice, so that we can eliminate four more components, leaving us 
with two propagating modes. We now proceed to carry out the reduction. 
The wave equation (8.8.5) in wave number space is 


kK byy hi KK Pay a K*k Pay + kk Ny Ak*p z k*k? bap) =0. 
(8.8.14) 
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Taking the trace of (8.8.14) shows that the n,, term can be dropped, 
leaving 


khur kK ky av kbar t kuke = 0. (8.8.15) 


Consider first k? # 0. Then 


Pav = k` Ek baat kèk, B- EEE g (8.8.16) 
and the gauge transformation 
Qur Quv t kur + Roa (8.8.17) 
with 
g=- Ke _ Lhd (8.8.18) 


reduces every component to zero. 
Therefore, only k* = 0 modes propagate. They, however, must be in 
the harmonic gauge, as we now show! Since k? = 0, (8.8.15) becomes 


kaput kb = kakio (8.8.19) 


where u = kK dua- 
We solve (8.8.19) by letting 


y, = Eg +e, (8.8.20) 
so that 
kuEr + kep =O. (8.8.21) 


Clearly, the only solution of (8.8.21) is €, =0. For example, go to a 
Lorentz system where no component of k,, is zero. Then k,,€,, (no sum) 


must be zero, as must €,,; since €, is a four-vector, it is zero in general. 
So, 


| ie core (8.8.22) 


which is the harmonic gauge. 
Within the harmonic gauge, we can eliminate the four @o,,’s by the 
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gauge transformation 


Pon Pou + Eko F fok (8.8.23) 
with 
_ _ $00 
Éo 2ko 
and 
1 rad 
pe Wa ties 8.8.24 
s A MST i ( ) 


This leaves (8.8.22) as a further constraint: 


k 


ktdu = = d. (8.8.25) 


First, take » = 0. This gives k” „o = ko G/2, so that œ = 0 and, thus, 
; ki 
kK bu = k Oj: = 5 6=0. 


From k’¢,; = 0, we see that for i= 3, ¢33 = 0. Since @ and do are also 
zero, SO iS @\; + G22. We are thus left with the two propagating compo- 
nents $); — z and ġı2. 

We can understand these degrees of freedom by studying their trans- 
formation properties under a rotation about the z-axis: 


x=x'cos@-y'sin@é, y=x'sin@+y'cosé (8.8.26) 


so 
ð Ox F ð : ð 
IX = cos 6, — = —sin 0, 2 = sin 0, and 2 = cos ð. 
ax’ ay’ ax’ ay 
Thus, since ¢,, iS a tensor, 
gs (8.8.27) 
ante Ts) saa id 
and 
' = cos? O61; + sin? 622 + 2 sin Ocos Oiz, (8.8.28) 


59 = COS” G22 + sin? 661, — 2sin 8 cos O¢12, (8.8.29) 
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and 


12 = — cos Osin Op,, + cos Asin Ad22 + (cos? 0 — sin? 6)bi2 (8.8.30) 
so that 


ti-ta Scos zo($u ta s ta) + sin 20,2 


and 


12 = cos 20¢ 12 — sin 29 Pua be 3 Toz 


(8.8.31) 


Evidently, the linear combinations 


du f= 08 iby 


transform by a phase 26: 


(2-ta ; p22 Sig ,) = gran 2u- da 22 + ids (8.8.32) 


These components are said to have helicity +2 (h = +2), respectively. 
We note that the helicity should be proportional to the projection of the 
wave angular momentum in the direction of propagation. The normaliz- 
ation for a classical plane wave would be 


J.w 
h = =—. 8.8.33 
a ( ) 


This is analogous to the case of the electromagnetic field that we 
studied in Section 3.8. There we found A = +1 for the propagating modes, 
and we explicitly demonstrated the relationship (8.8.33) for a suitable 
wave packet. 

We return to (8.8.1) and solve for the retarded gravitational radiation 
emitted by a known source §#”: 


p#”(x)=-— A [ax Gr(x ~ x') SPX’). (8.8.34) 
4n 
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Gr is given in (4.2.14). For a monochromatic source, 


SH” = SH”(r, w) ee + cc., 


we have, for large r, 


Heg (8.8.35) 
with 


pei À | dr’ e7™" S#"(r' w) = S""(k), (8.8.36) 


and k,,S#"(k) = 3k’S3(k). 
The gravitational stress tensor 7$, is calculated to lowest order from 
the last term in (8.7.11): 


(2) 
T8 = 7 k - oa | (8.8.37) 


Now (8u R)P = Na RO + 2A6,,.R™; since R?( 6) = 0, we drop it. 
Our 7%” is then given by 


(Ror z Tes R®), (8.8.38) 


2) 


with RẸ} given by (8.7.17). 

As usual, we average 7, over a cycle of the radiation. In evaluating 
the expression (8.7.17) for Tax, we may then substitute ež, for ¢,, on 
the left and €, for ¢,., on the right, finally adding the complex conjugate. 
With d, — iku, we have, in harmonic gauge, with kak“ = 0 

(2) 


4n)?r? 1 
Ar E (6 okyuky — Kakulon — Kakn€ue + kako€un) 


l 1 
— z kukne * ere + gee + kyen* — Teža) 


x (kach + kyen = Sig) + cc. (8.8.39) 


* 
= x ukal Exot the =| +c.c, (8.8.40) 
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Evidently, the trace R® of (8.8.40) is zero. Thus, we have for 7*” 


MEN 2 
ren = 7 > 5 fener — ict}, (8.8.41) 
myr 


The expression for 7”” is easily seen to be gauge-invariant (for e“” 
in the harmonic gauge). Since we can choose e = 0 and ew = 0, (8.8.41) 
shows that the energy of the free field is positive. 

The gravitational Poynting vector P, follows from (8.8.41). Since the 


only amplitudes that differ from zero are €11, €22 = — €11 and €21, we have 
2k 5 E j 
e= Cape l + | €22| + lel] } (8.8. ) 
2k°k {isu -= enl | 
= <"-+ 2le 8.8.43 
(4n)°r? J jel ( ) 


and the energy radiated per unit time and solid angle in the three-direction 
is 


d 2k”k jeze 


dtd 2 


dtd) (4ny'r° * Ziel. (8.8.44) 


To find the angular distribution of the emitted radiation, we must express 
dW/dt dQ. in invariant terms: We find 


dW _ 2(k°)’ 
dtd) (4m) 


1 1 A A A A 
|- zlet + le; |? + sles kikil Te 2€ jk j€izk, 


+ euk bet + ethkeu (8.8.45) 


which, with k in the three-direction, yields (8.8.44). The total rate is given 
by integrating (8.8.45) over dQ. With 


— 6 (8.8.46) 


and 
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the result is 


dW 2(k°)? E( 1 2) 
= x jen i]eul). 8. 
dt 4m ag 3 eel (848) 


We note here [as is already clear from (8.8.44)] that (8.8.48) responds 
only to the quadrupole (i.e., traceless) components of e;;, since, with 


Ej = qij + Oi Ecel (8.8.49) 
and qii 7 0, 
1 
eyes — leak? = qua. (8.8.50) 


Therefore, turning to (8.8.36), we have 
€,=—A fa e` ™TT (r, w). (8.8.51) 


For kr' <1, we can express the integrals over T;; in terms of moments of 
the energy density Too. From the conservation of T,.,, 


a"T,, =9, (8.8.52) 

follows 

~(K°Y Too = 458; Ty (8.8.53) 
and 

O\27).. 

- fart, C224 (3.8.58 
where 

Q= [a TooX iX;5 (8.8.55) 


with or without a subtracted trace. Also, although Q,, evidently depends 
on the choice of origin of the coordinate system, it only depends on it 
through the constant total energy, far Too = W, and the center of energy 
coordinate, f dr Too x, which depends linearly on ¢; since we are consider- 
ing a finite frequency, neither of these terms will survive. 
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Putting it all together, we find for the total radiation rate: 


dw 2 1 
ee FEE OF --10,)7}. 8.8.56 
F = Gtk)", i +iQut?) ( ) 


At this point, we reinstate the velocity of light c: 


dW _2Go" 
dt 5 œ 


o,o; z Houp}, (8.8.57) 


with œ the circular frequency of the radiation. 


CHAPTER 8 PROBLEMS 


8.1 The weak field gravitational red shift follows directly (both classi- 
cally and in quantum theory) from the equivalence principle. Show 
this by considering the emission and absorption of light, as follows: 


(a) Consider an emitter A and absorber B, both at rest, separated 


(b 


xe 


by a distance L along a uniform gravitational field. Describe 
the emission by A and absorption by B of a light signal from an 
appropriately accelerated coordinate system. In that coordinate 
system, the velocity of B when the light is absorbed will differ 
from the velocity of A when the light is emitted; the light will 
therefore be Doppler-shifted. Show that the Doppler shift 
agrees with the expected gravitational red shift. 

Consider an atom on the ground in a state of definite mass M,. 
The atom emits a photon of frequency w and goes to a state of 
mass M2. The photon, now with frequency w’, is absorbed by 
an atom with mass M; at a height L. The atom now goes to a 
state of mass M4. Using energy conservation, show that œw’ has 
the correct red shift. 


8.2 Examine the effect of a rotating source on the gravitational field. 


Consider a set of particles of equal mass m making up a rigid 


sphere that is rotating with angular velocity œ. The linear velocity 
of a particle in the sphere is then 


Va = WX Ty, 


and the first-order (in w) addition to the stress tensor is an off- 
diagonal component T, 


For a spherically symmetric distribution of matter, the angular 


*8.3 
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momentum of matter J of the sphere is given by 
J = Iw 


where / is the moment of inertia of the sphere 
2 
r=? | aroo 


with p the spherically symmetric density of matter in the sphere. 
Now find the new gravitational potential outside the sphere. It 
is given by 


(a) One suggested way of detecting the new potential would be to 
measure the precession of the spin of a gyroscope in an orbit 
around Earth. 

Calculate the effect, starting from the last equation in Problem 
8.2, keeping only the leading terms in v/c and ġo; in the equation 
for d*x/dt*. Proceed by considering each point mass b in the gyro- 
scope to be at a position x, =x + yp, where È, ys = Q. 

Expand in y, to include only linear terms in y, (this is legitimate, 
since 


y _ size of gyroscope spin velocity of gyroscope 


<1 and Z= <1. 


r radius of orbit ř velocity in orbit 


Now sum over b. The y, and y, terms disappear and the internal 
forces cancel by momentum conservation, leaving an equation of 
motion for the center of the gyroscope. Subtracting this equation 
from the original ones gives an equation for each d’y,/dt? in terms 
of the external gravitational field (Earth’s field) and the internal 
forces. 

Now calculate 


The internal forces holding the gyroscope together cancel again, 
this time by angular momentum conservation, leaving a right-hand 
side having terms bilinear in y and y, and terms quadratic in y. 
Assuming a spherically symmetric gyroscope, express the first in 
terms of the angular momentum L itself; then show that the second 
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is a time derivative. You should find 


where / is the moment of inertia of the sphere, 


x 
q= 2GV x ge) 
F 
and Q = q/2. 

The time derivative makes no contribution in a periodic orbit 
(or, for that matter, in any confined orbit, averaged over a long 
time.) The final result is thus 

dL 


— =Q xL, 
dt 


showing that L precesses about the vector Q with angular velocity 
Q. This precession is called the Lense-Thirring effect. 

Remember: The spherical symmetry of the gyroscope yields the 
identities 


> xix; X Ôj 
a 
and 
g DY xix? =0 
dia 


(b) Calculate the precession rate Q in relevant units for a satellite 
experiment: arc-sec/year. Take the satellite at one Earth radius 
in a polar orbit. 

The effect you have just calculated is of special interest, since 
it is distantly related to Mach’s principle, which to some extent is 
embodied in Einstein’s gravitational theory. Mach proposed that 
inertial frames are those frames that are at rest, or in uniform 
motion, with respect to the total matter in the universe. Thus, 
proximity to a large mass must distort the choice of inertial frame; 
indeed, we know that to be the case, since the curvature tensor is 
not zero near a gravitational source. 

Near a large rotating source, the transformation to an inertial 
frame should involve a rotation, so that a gyroscope would be 
expected to precess about the rotation vector of the source. Your 
result shows that this is not exactly true, since the precession vector 
Q is not in the direction of J; however, the smallness of the effect 
shows that the mass of the earth is much too small to compete with 
the large-scale matter in the universe. Presumably, a large enough 
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mass, properly configured and rotating, would drag the inertial 
system around with it. 


*8.4 (a) Starting from the equation of motion for a particle in a gravi- 
tational field, 


d? H A v 
“> + Oe ai do = Vg,,dx"dx” (1) 
do? do do Bu 


transform to a new independent variable dr. With v“ = dx“/dr, 
show that the new equation of motion is 


dut Pio gg dv“ Sue MIAT. 


- -TH v"và. (2) 
dT  v°gg,v" dT 2  v°’gyn_v" 


(b) Expand dv“/dr in characteristic vectors of the matrix 
MË = v“ uP gaplV Bont”. One of these is v“, with a characteris- 
tic value of 1; call the three others, with a characteristic value 
of 0, 7. 

(c) Show that the 7 can be chosen as 

ni =8 Puig 
where the three u;’s are orthogonal to v* and to each other: 
V” Uiu =0 


Ne Je Se 
Up Uin = — 6; (note minus sign!) 


Equation (2) now becomes, with dv“/dr = gu" + 3; bini”, 


v” vv" vð Bap _ 


È bin“ = Tivo’, (3) 


2o Boat" 
with g remaining undetermined until a definite choice for dr 
has been made. 
The free choice of g arises from the circumstance that the 
four components of (1) are not independent: 
n i 
ss 29 constant = 1. 
do do 
Since g may be freely chosen, only three of the resulting 
equations are independent. 
(d) Two obvious choices for 7 are time and proper time. Show how 
g must be determined for each case. We will then continue with 
7 chosen as the proper time, since that will give us a covariant 
set of equations—the gravitational equivalent of the Lorentz 
force law. 
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(e) 


8.5 (a) 


(b) 


(c) 


8.6 (a) 


(b) 


Show how to project the coefficients b; from (3) and find (after 
some work) the new equation of motion: 
H 
Mas (gt* — ug" ve) Ta, (4) 
dt 
where 
1 
Pe = 5 (OB oa t OAB av T rga) vV”. (5) 


Note that the tensor g“* in (4) is not the tensor 8 „a raised with 
the Minkowski metric ņn“”. It is the matrix inverse of g,.,: 


gh’g ar 7 ôk . 
Remember for future applications. For weak fields, 
Euv = Nur t 2d, but g=” = qn- 2o”. 


The covariant curvature tensor R,,., is antisymmetric in À < y 
and v<> 7; it is symmetric in Au + vy. Show that these condi- 
tions make Q jyon = Rapin + Ranar + Rama antisymmetric in 
any pair of indices. Therefore, the equation Q nva = 0 consti- 
tutes an extra constraint on R but only in dimensions four or 
higher. 


From the symmetry properties of R uran, find the number of 


independent components of R avan in three, two, and one di- 
mension(s). 


Show that in two dimensions R,,.,, can be expressed as 
(Barg ~ Eang ») 
R zun a see TE R 1212> 


and, therefore, the curvature R=2R,/~g and Rpm = 
(8argun — Eangur)RI2. 


Extend the work of Section 7.7 to an arbitrary dimensionality 
d. Calculate the number of conditions N. that must be imposed 
in transforming the tensor g,, at x = xo to the form g,,= 
Tw + O((x — xo)*). Now count the number of parameters N, 
available to carry out the transformation. Show that N, = 
Np — Nz, where N, is the number of parameters of a Lorentz 
transformation in d dimensions. 

Now, in one, two, and three dimensions, compare the number 
N: of conditions imposed in transforming g. to the form g, = 
Nur + O((x — xXo)*) with the number of parameters N, available 
to carry out the transformation. Show that N.= 
Np — Ni + Nr, where Ne is the number of independent compo- 
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nents of the curvature tensor that you found in Problem 8.1 
above. 


*§8.7 Verify the statement made following (8.7.28) that justifies the for- 
mula (8.7.27) for the gravitational action. 


*8.8 Consider an electron bound to a proton in a relativistic Coulomb 
orbit. The proton generates a potential Ay = — e/r, A = 0. Follow- 
ing the technique of Section 8.5, find the orbit equation r= 
f(O L, e°, m), where L is the electron’s angular momentum. 


8.9 (a) Show that in one dimension the radius of curvature of a curve 
y = f(x) at a point x is R = (1 + (y’)’)*”/y”, where a negative 
R signifies that the curve is concave downward. The radius of 
curvature is here defined as the radius of the tangent circle that 
has the same second derivative at the matching point. 


(b) A two-dimensional surface is embedded in three-dimensional 
space. It is described by an equation x; = f(x, x2). Choose 
af/ax, and ðf/ðx to be zero at xı = x2 = 0. Then f(x, x2), near 
x, x2 = 0, can be expanded (with proper choice of axes) as 

2 

= + O(2). 

2 


xi 
X1,X2) =—— + 
f (X15 X2) OR, * QR 


Calculate the metric tensor g. (Note that g = g” = 6; at x, = 
X2 = 0.) 


(c) Calculate the curvature tensor R1212 at xı = x. = 0 and from it 
the curvature scalar R. 
8.10 Define 
at” agf p PERE ap OR 
y= Ne and SS a ae, 
Bue gye ax? E Tae” ae 


where 7 is the usual 1, —1, —1, —1. Show that g”” is the inverse 
matrix of guv- 


8.11 Consider a two-dimensional space, with metric 
(d7? = (dx1} + f(x)(dx2)’, 
so 
8&u =t, 822 = fxi), 8&2 = 2) =D. 
Find the value of T$, for all u, a, A. 


8.12 (a) Consider a metric g;; = 1, g22 = f(x,). Calculate the curvature 
tensor R i212 and from it the scalar curvature R. 
(b) Verify that for f(x) = sin x, R is constant. Verify that for f(x) = 
x, R is zero. 
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8.13 In the harmonic gauge, 0,,.6°" = 4a’, the equation coupling 6"” 


to T”” is 
nt 
aad ph” — a0°¢=-AT” or da pt” = ~ ASK” 
(1) 
where 
Seva T — u 


T. 
2 
Show that (1) is consistent with the choice of gauge, in that 


6,5" ae Ss. 
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APPENDIX A 


Vectors and Tensors 


A.1. UNIT VECTORS AND ORTHOGONAL 
TRANSFORMATIONS 


A given orthogonal coordinate system determines an orthonormal set of 
unit vectors in terms of which an arbitrary vector can be expanded. In 
three dimensions, with €,, ê2, and ê; as the unit vectors, 


A=@,A, + €A> + 65A3. (A.1.1) 


The numbers A,, A2, and A; are called the 1, 2, 3 components of the 
vector A. 

A useful simplification is achieved by introducing tensor notation, in 
which a subscript takes on successively the values 1, 2, 3. Thus, (A.1.1) 
would be written as 


A=28,A; (A.1.2) 


A=@,A; (A.1.3) 


where in (A.1.3) the summation convention has been introduced. This 
convention requires that a repeated index be summed, unless otherwise 
stated. Notice that the dimensionality is now implicit. The tensor formal- 
ism allows us to deal simply with any (finite) dimension. 

The sum of two vectors A and B is defined as 


A+B= @ (A; ag B;). 
391 


392 Appendix A: Vectors and Tensors 


The product of a scalar c and vector A is defined as 
cA = 6;(cA;). 


The unit vectors are orthonormal. That is, we define the dot, or inner, 
product by 


8) = 8) (A.1.4) 


where the Kronecker delta ô; is defined by 


a= pe (A.1.5) 
0 otherwise 


The dot, or inner, product of two vectors is defined to be distributive 
under addition. That is, 


A-B+A-C=A:(B+C) (A.1.6) 


for any three vectors A, B, and C. Therefore (remember the summation 
convention), 
A-B=6@,A; -@,B; 
= ô; A;B; 
= A;B;. (A.1.7) 
Although we have introduced vectors as abstract objects, in two or 


three dimensions we can visualize them as directed line segments in our 
very own Euclidian space. Then, in two dimensions, we would write 


A=A,8, + Asey (A.1.8) 


and with @ the angle between A and the x-axis, é,, 


A, = Acos 0 
A, = Asin (A.1.9) 
and 
AP =A +A? =A A. (A.1.10) 


A similar expansion for B, 


B= B&, + BR, (A.1.11) 
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with 
B, = Bceosw and B, = Bsin y (A.1.12) 
gives 
A ; B = AB (cos 6 cos & + sin Osin y) 
= AB cos(8 — p), (A.1.13) 
the well-known relation of the inner product to the magnitudes of two 
vectors and the angle between them. 


We may consider a second orthogonal coordinate system with unit 
vectors ê;. The new unit vectors can be expanded in terms of the old ones: 


8; = Oâ. (A.1.14) 
Since the ê, must be orthonormal, 
Srp = By èp = O70; 6, = On Oj. (A.1.15) 
In matrix notation, (A.1.15) can be written more compactly as 
oO’ =], (A.1.16) 


where the ij matrix element of / is &; and the matrix elements of the 
matrix O are O;;, and of its transpose O7 are 


(O°) jp = Opi. (A.1.17) 


A matrix satisfying (A.1.16) is called orthogonal, since it connects two 
orthogonal coordinate systems. 

The determinant of an orthogonal matrix must be +1. To see this, 
we take the determinant of (A.1.16). This yields 


det / = det(OO’) 
= (det O)(det O”) 
= (det OY (A.1.18) 


so that det O = +1. 
It follows immediately that O has a unique inverse (as it must, since 
we could have expressed the @,’s in terms of the @;’s) 


o` = 0" (A.1.19) 
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so that also 
OTO = 1. (A.1.20) 


It is geometrically evident that any reorientation of the three unit 
coordinate vectors, keeping their relative positions unchanged, can be 
generated by a rotation. Since any rotation can be generated by a succes- 
sion of small rotations, the determinant of the transformation matrix O 
describing the rotation can never jump from 1 to —1. Therefore, when 
det O = 1, O describes a rotation, whereas when det O = —1, O describes 
a rotation followed by a single reflection, for example, 


A 


Ge =8, 8) =8, and @,=-ê,. (A121) 


A.2. TRANSFORMATION OF VECTOR COMPONENTS 


A 


When we transform the base vector €;, we must transform the vector 
components A; in such a way that the vector A itself remains unchanged: 


A =6,A; =ê Ar. (A.2.1) 
From (A.1.14) we see that 
€;A; E Orik A; (A.2.2) 
so that 
Aj = Ov; Ap (A.2.3) 


or, in matrix notation, with A as a column vector, 


A=O'A' (A.2.4) 


or 
A'=OA (A.2.5) 


so that the components A; transform exactly like the unit vectors @;. This 
is a special property of orthogonal transformations. In general, the base 
vectors transform differently from the vector components. Thus, if the 
transformations on the @,’s is P, that is, 


ê = P; 8; (A.2.6) 
and 


GA; =€,A;, (A.2.7) 
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we again must have, as in (A.2.4), 
A= PTA’ (A.2.8) 
from which, in general, 
A'=(P")'A#PA (A.2.9) 


except for orthogonal matrices. The transformation law of the base vectors 
is called covariant, that of the vector components contravariant. 

We define a vector under orthogonal transformations to be a set of 
objects A; that transform according to (A.2.5). The prototype vector is 
formed by the x, y, and z components of a point in space referred to some 
origin. 

The dot product of two vectors is independent of the coordinate 
system: 


A’: B= Ay By = 0;:;0;, AjBx = A,B; =A-B. (A.2.10) 


We call A: B an invariant, or scalar. There are also, of course, trivial 
scalars, such as fixed numbers, mass ratios, etc. Note that the product $A; 
of a scalar § and a vector A; is a vector. 

The importance of the transformation properties of vector components 
for physics is that a linear relation between vectors is preserved under 
coordinate transformations. Thus, if a physical law is 


A; = B;, (A.2.11) 
then obviously O,.;A; = Or:Bi, or 
Aj = B; (A.2.12) 


and the law is the same expressed in the new coordinate system. There- 
fore, no experiment can tell us which coordinate system we are using. We 
say this law of Nature is invariant (strictly, covariant) under rotations and 
reflections. A simple way of describing what this means is the following: 
Suppose we go to sleep in our laboratory and a playful genie turns our 
laboratory around. There is no way that we can on awakening tell whether 
and by how much we have turned. (Of course, there must be no nearby 
unturned objects that might serve as references.) Physicists believe that 
this property is exactly true of rotations: Space is isotropic. 

A reflection of our laboratory puts more of a strain on our genie: She 
would have to tear apart the laboratory and then reconstruct a mirror 
image of it and everything in it (including us). Then, if the laws of Nature 
are covariant with respect to reflections as well as to rotations, we would 
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not be able to tell whether the deconstruction had taken place or not. Put 
more simply and avoiding the painful deconstruction, we could not tell by 
watching an experiment whether we were looking at the real world, or at 
a reflection of the real world in a mirror. Unlike the case of rotations, 
reflection invariance holds to a very good approximation for most pheno- 
mena, but not exactly for any, and not even approximately for the weak 
interactions. 

A simple example of a vector law is Newton’s second law describing 
the acceleration of a particle of mass m at position x; in the gravitation 
field of a second particle of mass M at position y;: 


dx; G(x; -= y;) 
Ot ae NS ae (A.2.13) 


where m, M, G, t, and [(x, — y,)*]*” are scalars, and x; and y; are vectors. 
Note that the summation convention defines (x, — yx}? to be 
(xx — Ye)” = (Xe = Yak — Ye) 
SÈ eydy = Oy, (A214 


the square of the distance between x and y. Note also that x,(¢) and 
x;(t + At) are vectors and, thus, 


dx; Sih x;(t + At) ~ x,(t) 
dt = aro At 


is also a vector, as is d°x;/dt”, etc. 


A.3. TENSORS 


We define an nth-rank tensor to be a set of objects that transform like 
products of n vectors. Thus if T; ...;, is an nth-rank tensor, 
Tix. = Ons, 13670) Ta (A.3.1) 


ip hes 
th inin 


A simple example of such a tensor is, in fact, a product of n vectors: 
Tinsin AnBie Zin (A.3.2) 


which will obviously have the property (A.3.1). Note that a scalar is a 
tensor of rank zero and a vector a tensor of rank one. 
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Symmetry properties of tensors are invariant. Thus, if Tai... is Sym- 
metric (antisymmetric) under an exchange of i, and iz, then Tj; will 
have the same property under exchange of i; and i. 

We recall from (A.2.10) that the inner product A;B; of two vectors 
A, and B; is invariant. The generalization of this rule to tensors is that the 
trace with respect to any two indices lowers the rank of the tensor by 2. 
The trace with respect to two indices is defined as 


Trig icp) = Tipeciciccdn (A.3.3) 
i,j 


where, as always, the repeated index į is summed. 

Laws of nature that are linear relations between tensors have the same 
property as those between vectors, in that they do not permit experiments 
to differentiate between coordinate systems. 

We give a simple example of a tensor law. Suppose for simplicity that 
in (A.2.13) we let M be a heavy immobile object at y = 0. Then (A.2.13) 
can be written as 


md?x; _ _ mMGx, 
dt Gyo 


(A.3.4) 


Now multiply (A.3.4) by x;: 


d*x; Gxixj 
mx; — = = — 


m ` 
dr (xz)"* 


(A.3.5) 


interchange i and j, and subtract. The right-hand side of (A.3.5) vanishes, 
and we are left with 


d°x; oe) 
O= mx i> 
( ' dt? dr 
ee m(x as X; =) (A.3.6) 
dt dt dt 
and we have a tensor conservation law: 
g Lj = 0 (A.3.7) 
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where 


Li = m(x) -x =n, (A.3.8) 


Equation (A.3.7) is a new way of writing the conservation of angular 
momentum as the constancy of a second-rank antisymmetric tensor. 


A.4. PSEUDOTENSORS 


The connection between the second-rank antisymmetric tensor L;; (A.3.8) 
and the usual angular momentum vector 


L=mxxXv (A.4.1) 
is given by the equation 
L; = enla (A.4.2) 


where e€;;x is totally antisymmetric in exchange of any two indices, and 
€123 = 1. This uniquely determines all components of e: 


l = €23 = 6312 = €21 = hia = Se = east (A.4.3) 


all other components are zero. 
One might suppose that e is a third-rank totally antisymmetric tensor. 
To check, we calculate 


Erk = O70) OK KEK» (A.4.4) 


Since p;x is totally antisymmetric in exchange of any two indices, € 
must be a multiple of e. We can evaluate the multiple by calculating 
€ 123: 


E123 = O 1:O2;03k€ijk = det O (A.4.5) 


so that € is a tensor under rotations but has an extra change of sign under 
reflections. It is called a pseudotensor. In contrast, as shown in Problem 
(A.1), 6; is a tensor. 

Returning to (A.4.2), we see that L; transforms like a vector under 
rotations, but has an extra sign change under inversions. It is called a 
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pseudovector, or axial vector. This property corresponds exactly to that 
of an ordinary vector cross-product, as in (A.4.1): Note that an inversion 
(x— ~x) takes v — —v, but L —> L, which is not what a true vector should 
do. 

The pseudotensor ¢€,, evidently makes possible the construction of 
pseudotensors of any rank. For example, if A, B, and C are vectors, then 


A-BxC= EijkAi B; Ck (A.4.6) 


is a pseudoscalar. 


A.5. VECTOR AND TENSOR FIELDS 


In the previous sections of this appendix, we have considered single ten- 
sors, vectors, and scalars. Examples are the angular momentum, the velo- 
city and the energy of a body at some time, or the instantaneous electric 
field at a given point in space. 

We often wish to consider collectively objects associated with all of 
space, for example, the density p(x) of matter at a given time. Since mass 
and volume are scalars, p(x), the mass per unit volume, is also a scalar. 
We call p(x) a scalar field. Similarly, we can consider vector fields — such 
as the electric field—and, in general, tensor and pseudotensor fields of 
higher rank. 

Let us start with a scalar field p(x). The transformation rule for p(x) 
is 


p'(x') = p(x) = p(x(x')) (A.5.1) 


That is, the densities measured at the same point in both the primed and 
unprimed coordinate systems are equal. In the primed system, we express 
the density as a function of the primed coordinate x’. Therefore, the new 
function p'(x') is not the same function of x’ as p is of x. Hence, the 
notation p'(x’) in (A.5.1). 

Differentiation of a scalar field produces a vector field. To see this, 
let p be the scalar field. Then consider the gradient 


AG m p(x): (A.5.2) 


The instructions for calculating the gradient in the primed coordinate 
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system would be, by the chain rule for partial differentiation, 


A(x’) = pn’) 


OX; 
atk ar (A.5.3) 
OX; OX; 
Recall that x; = O;;x;, so 
t= (O7 )ii-x; ` 
ax 
is (O7) Oii, 
OX; 
and therefore, 


which is the correct transformation law for a vector. 


In general, differentiation of a tensor of rank n produces a tensor of 
rank n + 1.' For example, 


where A; is a vector produces a second-rank tensor Bj. Note that the 
antisymmetric tensor 


ĝ 3 
Fi = By - By = — A; - — A, 
Ox; OX; 
is related to the curl of A: 
1 
z Fn = (V x A);. (A.5.5) 


Since (0/ax,) A; is a tensor, its trace, (/dx;) A;, is a scalar. Similarly, 
the divergence of a tensor of rank n produces one of rank n — 1: 


é 
—F,. = 


isai? A.5.6 
"i 2 ( ) 


‘Warning: We have shown this to be true for orthogonal transformations. It is 
specifically nor true for general coordinate transformations. (See Section 7.6.) 
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A.6. SUMMARY OF RULES OF THREE-DIMENSIONAL 
VECTOR ALGEBRA AND ANALYSIS 


We list here, without proof, the most important of these.” 
1. For any vectors A and B 
A -B= ABcos 6 (A.6.1) 
where A is the magnitude of A, 
A= (A)? 
B the magnitude of B and 8 the angle between them. 


2. A X B=fiABsin 0 (A.6.2) 


where fi is normal to the plane containing A and B, and in a 

direction given by a right-handed screw going from A to B (through 

the smallest angle between them). Right-handed is defined here by 
the 

coordinate system in which (A.4.2) is written: €, x ê = @3. 


3. Ax (BX C)=A-CB-A-BC (A.6.3) 


provided B and C commute. 


4, A-BxC=AXxB-C 
A, A, A, 
=det| B. B, B: (A.6.4) 
CG. G 
5. Stokes’ theorem: 
pat-a=[ds-vxa. (A.6.5) 
Cc S 


Here, C is a closed path along which we take the line integral of 


?For a simple discussion, see H. M. Schey, Div, Grad, Curl and All That, New York: 
W. W. Norton, 1973. 
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A, S is a surface bounded by C, and dS = dSf, with dS the surface 
element and fia unit vector normal to the surface, in a direction 
given by a right-handed screw going around the circuit in the 
direction specified by dé. 


. Gauss’ theorem: 


[av-a= fas -A (A.6.6) 
A 


S 


where V is a volume bounded by S, and dS = dSfi, as before, with 
fi the outward normal. 


. A vector E with V x E = 0 is the gradient of a scalar: 


E=V¢. (A.6.7) 


. A vector B with V - B = 0 is the curl of a vector: 


B=VXA. (A.6.8) 


. The gradient operator in a general orthogonal coordinate system 


qı, q2, and q3 with unit vectors @,, ê, and ê; is given by 


é, ð ð é3 ð 
€ 9, E 3 8 
1ðqı hy dGan_—h3, 3 
where h; dq; is the increment of length in the q; direction. For 
explicit calculation, one must remember that the h’s and @’s are, in 


general, functions of the q’s. In particular, the Laplacian operator 
Y? is given by 


sail (2 (tat ay Bue i iht: Hi) 
h ihh3 qı hy 0qi 0q2 hə 0q2 0q3 h3 0q3 


(A.6.9) 


In spherical coordinates r, 0, œ, h, = 1, he =r, and A, = r sin 0. 


APPENDIX A: PROBLEMS 


A.1 Show that 6,; is a second-rank tensor. 


A.2 Construct a second-rank tensor T;; that is symmetric, has zero trace, 


and is a function of only one vector and the Kronecker delta. 


A.3 
A.4 
A.5 


A. 


N 


A.7 


A.8 


A.14 


A.15 
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Repeat Problem A.2, but for a third-rank tensor. 
Repeat Problem A.2, but for a fourth-rank tensor. 


Construct the most general third-rank tensor T; that is a linear 
function of a vector y, a bilinear function of a vector x, and antisym- 
metric under the exchange of i and j. 


Construct a third-rank tensor T;;x that is a function of two vectors, 
x; and y;, antisymmetric under exchange of i and j, and symmetric 
under exchange of x and y. 


Show that the triple scalar product, A -B x C, is given by 
A, Ay A, 
A-BxC=det{B, B, B,}, 
Ci GC, G 


from which you must show that A: B x C is invariant under an 
interchange of the dot and cross-product, that is, 


A-BxC=AXB-C. 


Using (A.6.3), show that the triple vector product is not, in general, 
associative, that is, 


A x (B x C) # (A x B) x C, 
and find under what circumstances the inequality becomes an 
equality. 


Given a tensor T;;xı, symmetric (or antisymmetric) in i and j, prove 
that the transformed tensor T; has the same property in i’ and 


at 


J. 
Let Tj, be a third-rank tensor. Show that 7;,; is a first-rank tensor. 
Show that €ijx€itm 7 1 Ok = Sim Oxt- 


Use the result of Problem A.11 to prove the formula (A.6.3) for 
the triple vector product. 


Use the definition of A x B: 
(A Xx B); = ejk AjBx, 
to show that A x B is orthogonal to A and B. 


Use the result of Problem A.11 to show that the magnitude of A x B 
is given correctly by (A.6.2). 


Show that the direction of A x B, as defined in Problem A.13, 
satisfies the right-hand rule. 


404 
A.16 


A.17 


A.18 


A.19 


A.20 


A.21 


Appendix A: Vectors and Tensors 


Suppose a body is spinning about a fixed point with instantaneous 
angular velocity œ. That is, every point r, in the body has instan- 
taneous velocity v, = œ X r,. The angular momentum L of the body 
is 


L = È m,r, X (w x rp). 


Show that the components of L are linear functions of the compo- 
nents of w: 


Li = Ijo; 
where J,; is called the moment of inertia tensor. Show that J,; is a 


second-rank symmetric tensor. 


Using the fact [derived in (A.1.13)] that A- B for any two vectors 
is given by AB cos 6, where @ is the angle between them, show that 


cos 9 = cos 04 cos Og + sin 04 Sin 03 COS(~4 — Ya) 


where 0, and 6, and ¢, and gz, are the spherical coordinates of the 
vectors A and B, with 6 their polar angle and ¢ their azimuth. 


Derive a formula equivalent to the one in Problem A.17 for the 
angle between two vectors in a four-dimensional Euclidean space, 
with polar angles 6, w, and @ for each vector, that is, A4 = A cos 6, 
A; = Asin @cos y, A. = Asin @sinwcos¢, and A, = Asin Osin Ņy 
x sin d. 


From Gauss’ theorem, (A.6.6), prove 
(a) | axVo= | dS 
V S 
(b) fervxa= fasxa 
vV S 


for any function ¢ and vector A. 


From Stokes’ theorem, show that if a vector A(x) has zero curl, then 
A(x) is the gradient of a scalar w(x), with y(x) given by 


W(x) = faea 
XY 
independent of the path from xo to x. 


Let V be a volume bounded by a surface S. From Gauss’ theorem, 
show that 
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[ cur ax = f vve-as - | ovivas 
v S v 


from which prove that a vector whose divergence and curl are both 
zero, and which vanishes sufficiently rapidly at ©, must be zero 
everywhere. Give the criteria for ‘‘sufficiently rapidly.” 


A.22 From Gauss’ theorem applied to a small cube, derive the expression 
(A.6.9). 
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APPENDIX B 


Spherical Harmonics and 
Orthogonal Polynomials 


B.1. LEGENDRE POLYNOMIALS 


The simple potential function 


See cana 
[(x AN ai i 
can be expanded for small r'/r in a power series in r'/r, and for small r/r’, 


in a power series in that variable. In order to avoid confusion with the x 
component of x, we here denote the magnitude of x by r: 


x- x’) = (B.1.1) 


r= (x)? (B.1.2) 


We can test for the radius of convergence of the series by finding the 
zeros of the function 


D=(x-x'? =r? +r? - 2rr' cos 6, (B.1.3) 


where 6 is the angle between the vectors x and x’. 
If we let 7'/r = u, the equation D = 0 becomes 


1+ u?—2ucos@=0 (B.1.4) 
whose solutions are 
u = cos 0 + Vcos*@—1=e"", (B.1.5) 


The function ¢, considered a function of the complex variable u, is 
406 
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therefore analytic in u for |u| < 1 so that the power series in u is convergent 
for u < 1. Thus, we may write 


æ t 


: ay) (4) P-(cos 6)  (B.1.6) 


(P +r? —2rr' cos 0)? r’ e=0\r 


$ 


forr<r', and 
20 y € 
j= (2) P,(cos 0) (B.1.7) 
r¢=0\7T 
forr>r’. 

The function P,(cos 6) defined by (B.1.6) and (B.1.7) is clearly a 
polynomial in cos 6 of degree £, even or odd in cos 8 according to whether 
£ is even or odd. It is called a Legendre polynomial. It is also clear that 
the function r‘P,(cos 0) is a polynomial of degree £ in the components x, 
y, z of the vector x, with x’ held fixed. In particular, if we choose x’ 
as the polar axis of the x coordinate system, then rcos 6= z and r= 
x? + y? + 27, so that r‘P,(cos 6) is a polynomial function only of z and 
r°. The function ¢ with the property (B.1.6) is called a generating function 
for the Pes. 

We study some properties of the Pes: 


1. Since 


y2 1 Vv" 1 


=0 (x#x’) and 


=0 (x#x’), 
(B.1.8) 


|x —x’| lx —x’| 


(B.1.6) shows that the polynomial function of x, r‘Pe(cos 0), is 
harmonic; that is, it satisfies the equation 


V?r‘P (cos 0) = 0; (B.1.9) 
Similarly, 
y7 1 P¿(cos 0) = 0. (B.1.10) 
r 
2. In the forward and backward directions, the expansion (B.1.6) 
becomes 
æ I 
poet SS 21)°(4) (B.1.11) 
lr#r'| reo r' 
so that 


P(+1)= (+1). (B.1.12) 
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. Harmonic polynomials of different order are orthogonal. That is, 


if 
Vpe= Ve =0 (B.1.13) 


where p,- and qe are polynomials of order f and £' in x, y, z, then 
the integral over solid angle, dQ, 


| p40 = 0, LHC. (B.1.14) 
Proof: Integrate over a spherical volume: 


| (pN'qe — qeV'p,) dr = 0 (B.1.15) 


so that 


| (pNae — deVpe): dS =9 (B.1.16) 


S 


integrated over the spherical surface bounding the volume. Since 
dS -V =r? dQ(ð/ðr), (B.1.16) becomes, with 


Pe=r*Y(Q) and qe=r" Ze(Q) (B.1.17) 


Q= r Ere of Y(Q)Z-(Q)dQ  (B.1.18) 


and the result is proved. In particular, Legendre polynomials P(w) 
and P(w) with € + €' are orthogonal when integrated over w. 
Here, w = cos 6, dw = sin 0 d0, and dQ = dø dw, where © goes 
from @ to 27 and w now goes from —1 to 1. 


4. The normalization integral for Pe can be found by integrating 


Wir -r'/?: 
1 
| aa ral dw 2 -+ 1 
jr-r'] (7 +r — 2rr'w) 
zy 
2m A (r+r')? 
2rr’ Pare 


= ŽT Ilog(r +r) ~ log(r'-r)]  (B.1.19) 
rr 
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for r’ >r. Here, the symbol log x stands for the natural logarithm 
of x. Expanding in powers of r/r’, we have 


dQ, 
eee tox(1 + z) = log(1 - )| 
jr—-r’ vrr r r 


sae (2) ) 
ro 2\r' 3 
2 5 
4 
= “2/142 (2) ++(2) + |, (B.1.20) 
r' 3w ao Nt 
On the other hand, 


æ 


#0 faai 3 (5) ee) 3 (E) ro) 


Gad 2e 
= z 2 (=) [ depen) Pe(w) (B.1.21) 
ro ¢=0\6r 
so that, in all, 
| Petw)P (w) dw = —— 5 (B.1.22) 
ee we i 


5. The P,’s are orthogonal polynomials of parity (—1)° and of order 
£ in w, with a weight function 1 on the interval w between +1 and 
a normalization P,(1) = 1. Evidently, the P;’s can be sequentially 
constructed from these rules. Thus, 


Pow) =1 (B.1.23) 
Pi(w) =w (B.1.24) 
Px(w) = at Bw’, (B.1.25) 


etc. 
P, is automatically orthogonal to Pı; orthogonality to Po 
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requires 
1 
| dw(a + Bw?) =0 
Sf 
or 


a= 2B (B.1.26) 
Together with P2(1) = 1, or a+ B= 1, (B.1.26) determines P3 as 


P(w) = : we — A (B.1.27) 


This procedure can be continued to construct any finite-order Leg- 
endre polynomial. 


B.2. SPHERICAL HARMONICS 


We return to (B.1.1) and expand directly in powers of x, y, and z using 
the three dimensional Taylor’s theorem. That is, for r <r’, 


E en 5 1) R (B.2.1) 
d(x — x') = Er a (B.2.2) 


= m (B.2.3) 


g(x’) is a symmetric tensor of rank £. It is also traceless, since con- 
tracting on any pair of indices in (B.2.3) produces a V’*, which in turn 
gives zero acting on 1/r’. 

At a given point in space, we can count the number of independent 
components of © [i-e., the number of independent numbers we must 
specify to determine all the components of #]. That number is 2€ + 1, 
as we now show. 
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We count first the number S(€,N) of components of an €th-rank 
symmetric tensor in N dimensions. We call the number of x; components 
nı, of xz components m, etc. The number of independent components is 
then 


S(@,N)=D---D-1. (B.2.4) 


The sum can be deduced from 


li 
mM s 
pc) 
s 
z 
— 
a 


S(z, N) 


~ 
Il 
© 


Il 
Ms 
N 


A ES 5 ZN 


= 
i 
° 
3 
È 
ji 
o 


N 


aa p25 


which is convergent for |z| < 1. We can recover the €th term in the series 
by integrating the complex function 1/(27iz‘*') S(z, n) around the origin: 


S(é, N) = $25 Ste.) (B.2.6) 


a 
flaz‘\1l—-z 


NN De NE) 


z=0 


£! 
-1)! 
= Wtt-1)! (B.2.7) 
(N — 1)!€! 
which is the desired formula. For N = 3, we have 
+ 2)! + 1)(€ + 
ege eS Ey (B.2.8) 


2!£! 2 


The number S,(¢, 3) of independent components of a traceless sym- 
metric tensor in three dimensions is 


So(€, 3) = S(£, 3) — S(£ - 2,3) = 2€ + 1 (B.2.9) 
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since the symmetry of S(£, 3) assures that all traces are identical and since 
the trace of an fth-rank symmetric tensor is an € — 2th-rank symmetric 
tensor. 

We return to (B.2.2), which we may write as 


' 5 (-1 ETA 5 (= 1)‘ E) ' 
ox- x’) = Pakai Ka a) Xi MOH AX’) 
é=0 f£! eo £! 
(B.2.10) 
where 
PU og Saas (B.2.11) 


is a symmetric €th-rank tensor. 

Since there are only 2¢ + 1 independent ¢‘*”s, there must be only 
2¢ + 1 independent components of the P‘’s that matter. We exhibit this, 
making P+ traceless by subtracting Krénecker deltas. These have no effect 
on @, since 


Spi ig... = 0. (B.2.12) 


We illustrate the subtraction procedure for € = 2. Define 


PP = PP - aô; (B.2.13) 
such that P? = 0; that is, 
3a = PP 
and 
PP = PP — R PR (B.2.14) 
XiX} 5;;x°. 


The above procedure, applied to P“ of arbitrary rank, produces an ¢th- 
rank, symmetric, traceless tensor, P{)._ ;,(x), with 2€ + 1 independent 
components. 

Since V*1/|x—x'|=0 for (x#x’), the polynomial product 
PP (x) pP. aX) must satisfy Laplace’s equation; since there are 
w+] independent $s and 2€ + 1 independent Ps, each P‘ must 
satisfy Laplace’s equation, that is, 


VPP 4, = 0. (B.2.15) 
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We have thus shown that there exist at least 2 + 1 harmonic polyno- 
mials of degree €: the Po), _i,$. And we may write 


ae 5 (-1) £) (€) 
P= oe rf Pi iis). ie (B.2.16) 


We show next that there are exactly 2¢ + 1 independent harmonic 
polynomials of degree £. We proceed by writing an arbitrary polynomial 
of degree f as 


yO = > (x + iy) (x — iy)"?z"™a(m,, m2, m3) (B.2.17) 


mytm2t+m3= € 


where the coefficient a(m,, mz, m3) is to be determined. 
Suppose m, > mz; we then write 


(x + iy)"™(x — iy)” = (x + iy)" "20? + y)"; (B.2.18) 
if m, < m, we write 
(x + iyya — iy)? = (x = iy"? + yy, (B.2.19) 


We can expand 


YO = d yor (B.2.20) 


where, for fixed m =m; - m,>0, 


yor = (x + iy)” ») (x? + y*)'"22"3a(m,,m2,M3).  (B.2.21) 


2m2+m3=f-—m 


Note that m may take on values from 0 to £. The mz and m; values with 
m > m; add £ functions proportional to (x — iy)", so that in all we have 
functions ¢““’”? whose azimuthal ¢ dependence (in spherical coordinates) 
is e’"?, with m taking on 2€ + 1 values between +f. Note that functions 
with different m values are orthogonal: 


| POM yee AO = 0, mÆm. (B.2.22) 


We already know from (B.1.14) that ws of different £ are orthogonal. 

Finally, we show that orthogonality between different € values 
determines y‘ to within a constant multiple; normalization determines 
it to within a phase. 
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We first note from (B.2.17) that 
pO(—x) = (-1y"""™ + mayé (x) 
= (Dyw). (B.2.23) 
The transformation x > —x in spherical coordinates is given by 
petr, 6> 7-6 (B.2.24) 


so that, since e’”” >(-1)"e’"* under the transformation (B.2.24), the 
residual function in (B.2.21) 


È (x? + y?y"z"a(m, ma, ms), (B.2.25) 


must be multiplied by (—1)*~” under the transformation z > ~z. 
Factoring the r“ dependence of w“”””, we define 


YPE = rf sin'19 oP é,m(Cosé) (B.2.26) 


where P,,,,(cos 0) is a polynomial of degree € — m in cos 8.' Under the 
transformation r— —r, then, 


P e m(—cos 0) = (—1)°'"P,,,,(cos 8). 


The functions (B.2.26) are already orthogonal for different m values. 
If they are to be harmonic, they must also be orthogonal for different € 
values. That is, 


| peor” yo") dO = 0, €#£’ or m+m'. (B.2.27) 


Following (B.2.27), we construct, for each m, a sequence of ortho- 
gonal polynomials of degrees €—|m| in z, starting with €=|m| and 
containing only even or odd powers of z according to the parity (—1)‘~”. 
The weight function for the sequence is (sin'""'@)? = (1 — cos? @)!"!, 


‘We introduce the bar in the symbol Pem because there exists a conventionally defined 
symbol Pé,m with a different normalization. 
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TABLE B.1. 
m\t 0 1 2 3 4 
4 sin’ ge”? 
3 l sin? 0 e** sin? 6e** cos 6 
2 sin? 0e”? sin’ 8e”? cos 8 sin? 9e”?(g cos? 6 + h) 
1 sin 0e"? sin 0e cosð sin @e'*(ccos?@+d) sin Pe’ cos 6(j cos’ + k) 
0 1 cos@ ~ a cos? 0+b cos 6(e cos% + f) : 
-1 sin ĝe “* sin 8e? cos 8 
sin? 6e7 7% 


This is illustrated in Table B.1. 

Evidently, there are just enough orthogonality relations to determine 
all the coefficients to within a constant multiple. If we normalize these 
functions to one, they are determined up to a constant phase. For some 
purposes, it is convenient to choose 


Pein = (-1)"P e-m (B.2.28) 


and we shall generally do so. 

We have now constructed for each m a unique orthonormal set of 
polynomials of degree £; since m ranges from —f to £, there are, for each 
£, 2€+ 1 polynomials. Harmonic polynomials may be chosen to have a 
fixed m, since the Laplacian operator V? does not mix values of m, that 
is, 


V? e™™®F(0) = e™”®°G(0). (B.2.29) 


In addition, harmonic polynomials must be orthogonal for different 
€, a property by which they are uniquely determined, as we have just 
shown. Therefore, our orthogonality procedure has determined the 2¢ + 1 
harmonic polynomials of each order. 

The notation we use for the normalized functions is 


Yem(O, o) = sin 0e. P i mlw). (B.2.30) 


B.3. COMPLETENESS OF THE Y,,,, 


We show first that the Legendre polynomials are complete, that is, that 


È (2€+1) f Pê- Pf) dO’ = 4rf(r) (B.3.1) 
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for f(r) a sufficiently well-behaved function of angle, and where the 
vectors r' and r have the same length: |r’| = |r| = r. 
We proceed by considering the integral 


F= |as (7 } va fe) (B.3.2) 


r -r| 7 Ir — rol 


where the surface S’ is a sphere of radius r. The vector r; is inside the 
sphere, rọ outside the sphere; for simplicity we take them to be colinear. 
We will take the limit r;— r and rp > r in two different ways. 

Except for the singularities in the integrand (B.3.2) at r’ = r; andr’ = 
Yo, the limit would be zero. We take the singularity properly into account 
by expanding f(r’) about r; in the first term of (B.3.2) [as f(r’) = 
f(r) + (r -r;)-Vf+--+] and about ro in the second. The terms with 
r’—r; and r’ — ro to the first and higher powers remove the singularity 
and, hence, in the limit can be dropped. There remains 


Pes fasal fe) - ted) 


1 
|r — rol 
+ terms that go to zero, so that 


lim F= 4rf(r). (B.3.3) 


ae 
ror 


On the other hand, expanding 1/|r—r;| in powers of r;/r and 
1/|r — ro| in powers of r/ro, we have, from (B.3.2), 


€ 


d rf r 
F= -PX fao ( L-2 J&t ? 
Mi £ dr’ yr} rg} fF PF’) f(r’) 


or 


A ttt- 
F=} {aol + (4) + (©) | PLR) f(r’). (B.3.4) 
€ r ro 
It is now safe to take the limit, which yields, as claimed, 


4nfir;) = > (2€ + 1) | Pele fE dQ’. (B.3.1) 
g 


Next, we note that since r‘P;(?- f’) is a harmonic polynomial of order 
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€, as is r’“P,(-f’), we must be able to expand P,(f- f’) in terms of the 
Y¢m(Q)’s and Yem(Q'Ys, that is, 


PPP) = È amm Vem (9, 9) Vbm(6', 9"). (B.3.5) 


A 


Since f-¥ = cos 0 cos 6’ + sin @sin 6’ cos(g — ¢’), (B.3.5) must depend 
only on g- ¢' (not on ¢ + ¢’). Therefore, amm must equal zero unless 
m' = m, and (B.3.5) becomes 


PAP?) = DL amYem(9, 9) Y$ mlO, Q’). (B.3.6) 


Note that since P, is real and invariant under interchange of f and f", a, 
must be real. We perform two operations on (B.3.6): 


1. Set = f and integrate over dQ. We find 
4m = È am. (B.3.7) 
2. Now square (B.3.6): 
PEEP) = Y am Yem (O) Yén (O) 
X D aha Y bal) Yem (Q). (B.3.8) 
m 


3. Integrate over dQ. There results 


4T 
2£ +1 


A A (B.3.9) 
4. Finally, integrate over dQ' to find 


Ur _ 5) o 
SET È jan. (B.3.10) 


The unique solution of (B.3.7) and (B.3.10) is a,, = 4a/(2€ + 1). (See 
Problem B.11.) Our final result is therefore 


é 


4r 
2€+ 1 m= 


PARP) = Yem(Q) YEm(O’). (B.3.11) 
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The completeness theorem (B.3.4) for the Pes then tells us that any 
function of angle can be expanded in the Y,,,,’s: 


f(a = TE*Y | pe py poy ao’ 
¢ 4n 
or 


fO) = 2 Yem(0)| Y*,,(0') f(O') dQ’. (B.3.12) 


The Y¢,n’S are sometimes referred to as spherical tensors. Recall the 
Cartesian form of harmonic polynomials generated by the coefficients of 
ps in (B.2.16): 


Pin. ig = (Xn. . Xie ~ traces). (B.3.13) 


These transform under rotations as éth-rank, symmetric traceless ten- 
sors. 

The Y¢ m's also have a simple transformation property, since a solution 
of the Laplace equation must remain a solution under rotations. There- 
fore, 


Ye (Qe) =È Diam (R) r Yem (O) (B.3.14) 


where Qp takes the 0, ¢ of the original coordinate system (Q) and changes 
them to the 0, ¢ of the same point with respect to the new coordinate 
system. The expansion coefficients D £m (R) define the transformation of 
a spherical tensor. 


APPENDIX B PROBLEMS 


B.1 From the definition (B.1.6) of the Ps, show that P(w = 1) = 
€(€ + 1)/2. 


B.2 Again using (B.1.6), show that with w = cos 9, 
P-(w =0) =0 for £ odd 


paige 2) 


n! 


for £ even 


where n = €/2, and the product 5- -- (n — 3) = 1 for n = 0. 


B.3 


B.4 


B.5 


B.6 


B.7 
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Again using (B.1.6), show that the P,’s satisfy the differential 
equation 


~ #4 gy yan 2 
cs (l-w ka P(w) = €(€ + 1)Pe(w). 


Again using (B.1.6), show that 
(£ + 1) Peai(w) + €Pe-1(w) = w(2€ + 1) Pe(w) 
and from that, show that 
! 
| PewP-dw =0 
-1 
unless €’ = £ + 1 and that 
1 
| Peri (w)w P(w) dw = 


=r 


2(€ + 1) 
(2€ + 1)(2€ + 3) 


Use the orthonormality of the Y¢ m's to construct all the Y; m's for 
£ = 4 as suggested by Table B.1. 


Show that the function 
7. 


Pelt, r) = (za + iy) - “(x —iy)- 22) 


for any 7 is a harmonic polynomial of order € and is a generating 
function for the Y¢,,.’s: 


€ 
P(T, r) = rf > tT" Cem Yem(9, p) 
m=-f 


where the cem s are constant coefficients. 


Show from the results of Problem B.6 that 


ð . ô . = 
(= +: Zips Yem = dem ré EX Peas 
Ox oy 


that 


ð ô > = 
(o ki =) Vem = dim r‘ Yim- > 
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and that 
KA € =d" E~] Y 
r Yim me tmt -l,m 
ðZ 
where d, d', and d” are €, m-dependent constants. 
B.8 The charge distribution on a spherical surface is given by 
o=A-r, 


where A is a constant vector. 
Find the potential 


go = | 2 aa 
Ir~ r'| 
and the field 
E=-V¢ 


inside and outside of the sphere. 


B.9 Verify by direct integration Newton’s theorem that the potential 
outside a spherically symmetric charge distribution p,(r) is the same 
as it would be were all the charge concentrated at the center. That 
is, for r outside of the region in which p is nonzero, 


far A -2 
r-r} r 


where Q = f dr’ p(r’). 


B.10 We know from (B.2.26) that r°Yem is harmonic and of the form 
ref, (8). Show from this that fem satisfies the equation 
| l d 9 d m? 


; sin -— 
sin 0 d0 d@ sin? 


; + £(€+ Db fem =0. 


B.11 Prove the statement made following (B.3.10), that (B.3.7) and 
(B.3.10) imply am = 42/(2€ + 1). 
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for interacting particles and gravitational 
field, 349-350 
Affine connection. 319 
Ampére’s circuital law. 55 
Analytic functions. use of. 40-44 
Analyticity of dielectric constant in upper 
half plane. 91 
Angular momentum: 
field, 121 
tensor, 277 
of vector multipoles, 227 
Anisotropic media. propagation in. 
114-118 
Atomic systems: 
absorption by. 95 
Hamiltonian for, 96 
polarizability of. 95 
Schrédinger equation for, 96 


Bessel functions, spherical, 199 
Bianchi identity. 369 

Biot and Savart. law of. 50-51 
Boltzmann distribution, 63 
Boltzmann equation, 130 
Boltzmann-Vlasov equation, [31 


Boundary conditions at material boundary: 
electromagnetic. 88 
electrostatic, 31 
magnetostatic, 62 
Bremsstrahlung. low-frequency. 159-165 
Brewster's angle. 113 


Capacitance. 23 
Cauchy principle value. 101 
Cauchy-Riemann equations. 40 
Cerenkov cone. 174 
Cerenkov radiation. 170-176 
Charge: 

bound, 26 

conservation of. | 

density, 2, 132 

electrostatic unit of. 2 

free. 26 

line. 3. 41 

magnetic, 132 

surface. 3 
Clausius~Mossotti relation, 45 
Coefficients: 

of capacitance, 21 

of induction. 70-71 
Compton wavelength, 182 
Conductivity. 48 
Conductors: 

electrostatics in the presence of. 16-20 

resistance of, 48 

systems of, 20-24 
Conservation laws. relativistic. 271-277 
Continuity equation, 47 
Coulomb's law, 1-9 
Covariance. general, 313-323 
Covariant derivative. 318 
Covariant electrodynamics, 260-269 
Covariant equations for a point charge in 

electromagnetic field, 269 
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Covariant vector, 254. 257 
Current: 
circuits, 70 
in conducting fluid. uniqueness of. 77 
continuous distribution of. 54 
density. 47 
displacement, 82 
steady. 47-50 
surface, 62 
time-dependent, 81-134 
Curvature: 
defined. 367-368 
independent components of, 369 
symmetry properties of. 368-369 
Cylinder, charged, 36 
Cylindrical wave guide: 
TEM mode. 127 
TE/TM modes, 128 


Delta function. 3 
Density: 
charge, 2. 132 
current, 47 
energy, average, 105 
Lagrangian. 284-290, 307, 332-333. 
345-349 
magnetic moment, 5& 
scalar, Lorentz. 261 
tensor, Lorentz. 263 
vector, Lorentz. 262 
Diamagnetism. 65. 73-77 
Dielectric tensor: 
analyticity of, 90, 95 
anti-Hermitian part. 94, 101 
antisymmetric component of. 33 
defined, 32 
frequency dependence of, 90 
Dipole: 
electric. 9 
of classical atom in magnetic field. 125 
moment per unit volume. 30 
yuantum calculation of. 95 
radiation of. 146-150 
magnetic. 50 
radiation of, 148 
Doppler shift, 269 
Dual tensor, 266 


Einstein field equations, 370-376 
Electrodynamics, covariant. 260-269 
Electromagnetic field: 
equations of motion for a point charge 
in. 269-271 
in matter, 84-9] 


optical theorem for. 210-211 
of point charge moving at constant high 
velocity. 150 
propagation, 102 
scattering of, 208-240 
tensor. 264 
Electromotive force: 
defined, 49 
motional, 66 
Flectrastatics: 
conductors: 
in the presence of, 16-20 
systems of, 20-24 
Coulomb’s law. 1~9 
energy: 
in dielectric medium, 32-36 
stress and, 12-16 
fields in matter, 24-32 
multipoles/multipole fields, 9-12 
Elevator coordinate system, 356 
Energy: 
average density of. 105-106 
balance in reflection and transmission. 
125 
conservation of. 14 
in dielectric medium, 32-36 
electromagnetic. 91-95 
in electrostatic field. 12-16 
of gravitational field, 347 
loss: 
in circular acceleration, 280 
in tinear acceleration, 279 
in magnetic field. 69-73 
of magnetic moment in magnetic field. 
63 
of particle, in time independent field. 287 
radiated, 143 
of scalar field. 182 
total radiated by relativistic particle. 169 
of vector field. positivity of, 309-310 
of vector multipoles, 227 
Fnergy-momentum tensor, 273 
Entropy. 34 
Equivalence principle. 357 
Extraordinary ray. 117 


Factorization of wave equation. 201, 
243-244 
Faraday: 
effect and coefficient. 126 
law of induction. 68 
Ferroelectric, 29 
Field(s): 
causal relationship between, 82 


classical equations for. validity of. 
24-25, 153 
with definite wave number, 103 
electric. effective, 66 
electromagnetic, see Electromagnetic 
field 
electrostatic. 2 
in matter, 24-32 
uniqueness of. 17. 43 
functional. 19 
gravitational, see Gravitational field 
in isotropic materials. 102-109 
Liénard-Wiechert. 165-170 
macroscopic, 24 
Magnetic units for, 50 
magnetostatic. 50-66 
in matter. 61-66 
uniform. 78 
Microscopic. 24. 6} 
scalar. see Scalar field 
tensor: 
covariant. 260 
electromagnetic. 264 
gravitational. 34] 
propagating modes of. 344 
time-dependent, 81-134 
uniform. 78 
vector, 304-313, 399 
Fields and particles. interacting. 
298-304 
Finite time signal. 105 
Flux: 
of angular momentum. 228 
electric, 6 
energy. of electromagnetic field. 93 
energy. of gravitational field. 382 
energy. of scalar field. 183 
magnetic. 66 
time-averaged. 105 
time-integrated. 105 
Force: 
between circuits, 53 
on currents, total, 72 
on dielectric, 36 
electromotive. 66-69 
generalized: 
on circuit, 71-72 
on conductor, 21-2? 
Lorentz, 52 
magnetic energy and, 69-73 
of magnetic field on circuit. 51 
radiation reaction. 178 
volume. 54 
Free energy. 34 
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Functional. 19 


Gas, dilute. 26 
Gauge: 
Coulomb, 135 
harmonic. 354 
invariance, 56, 135 
Lorentz. 136 
transformation, gravitational. 353. 
378-379 
Gauss’ law. 7-8. 36 
Gauss’ theorem, 402 
Gravitational field: 
action for, 375 
energy of, 347 
interaction of. 356-367 
Lagrangian density for, in linear theory. 
345-349 
orbit. precession of. 360 
overview. 338-340 
particles in, 349-356 
propagating modes, number of. 377-378 
rotating source for, 384 
static. 358 
weak field limit, 353 
Gravitational redshift, 363. 384 
Gravity: 
curvature and, 367-370 
Einstein field equations. 370-376 
gravitational field. see Gravitational field 
principle of equivalence, 357 
radiation, gravitational, 376-384 
tensor field, 341-345 
Green's function: 
electrostatic. 17 
for massive scalar field. 188-191 
for radiation: 
advanced. 139 
defined, 137-138 
retarded, 138 
Group velocity. 185 
Gyromagnetic ratio, 59 
Gyroscope. precession in gravitational 
field. 385 


Hall effect. 48 
Hamilton’s principle. 281 
Harmonic(s): 
function. 9. 407 
polynomials. 408 
spherical: 
completeness of, 415-418 
overview, 416-415 
Helicity, 118-123 
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Images, 38-39 
Induction: 
coefficients of, 70 
electromagnetic. 66-69 
magnetic, 62 
Infinitesimal transformation, 246 
Infra-red divergence. 163 
Initial conditions. consistency of. 89. 106 
Interaction action. 298 
Internal energy. 34 
Invariance: 
overview, 245-248 
reparametrization. 285 
Invariant volume element. 317 


Kronecker delta. 392 


Lagrange equations: 
for fields. 291 
for particles. 282. 286 
Lagrangian: 
for fields, 290-297 
for gravitational field, 345 
for particles, 281-284 
relativistic. for particles in field. 284-290 
Lagrangian density: 
defined. 290 
for electromagnetic fields. 307 
for gravitational fields. 345-349 
higher derivatives in. 332 
with nonlocal interaction, 333 
Langevin-Debye equation. 64 
Laplace's equation, 9 
Laplacian. in generalized coordinates. 322 
Larmor precession, 73 
Legendre polynomials: 
defined. 406 
generating function for. 407 
Lense-Thirring effect, 386 
Lenz's Jaw, 65 
Liénard-Wiechert: 
tields. 165-170 
potentials, 156-159 
Light. bending of, 360 
Lines of force, equation for. 44 
Lorentz-Fitzgerald contraction, 242 
Lorentz force. 52 
Lorentz group, 252 
Lorentz tensors, 257-260 
Lorentz transformation. 248-256 


Mach’s principle. 386 
Macroscopic equations: 
electromagnetic. 88 


electrostatic. 31 
magnetostatic. 62 
Magnetic moment: 
density. 58 
energy of. in magnetic field. 63 
Magnetic dipole radiation. 148 
Magnetic pseudopotential. 59 
Magnetostatics. 47 
Material discontinuity, boundary 
conditions at. 88 
Maxwell's equations, 81-83 
Maxwell stress tensor, 15 
Medium: 
conducting, 48 
semi-infinite. 107 
Metric tensor. 315 
Moment. dipole. see Dipole moment: 
Quadrupole 
Momentum: 
angular, 121. 227,277 
electromagnetic. 91-95 
Monoaxial crystal, 116 
Monochromatic signal. 105 
Monopole. electric. 9 
Motional electromotive force. 66 
Multipole(s): 
electric. total cross section for, 238 
electrostatic, 9-12 
fields. electrostatic. 9-12 
magnetic. total cross section for, 238 
magnetostatic. 56-61 
scattering by a dielectric, 230-240 
vector, 217-226 


Noether current, 292 
Nocther’s theorem. 282 


Ohm's law, 48 
Optical theorem: 
for electromagnetic field scattering, 
210-211 
generalized, 240 
Orbit equation. in static gravitational 
field, 358 
Orbit precession in static gravitational 
field. 360 
Ordinary ray. 117 
Orthogonal polynomials. 406, 409, 414 


Parallel displacement. 318 
Paramiagnetism, 63 
Partial waves. for scalar field, 203-208 
Particle(s): 

in electromagnetic field, 285-286 


finite size charged. 124 
in given scalar held. 33) 
in gravitational field, 349-359 
interacting. 298-304 
relativistic. 169, 284-290 
scattering. 164 
slowly moving point. [44-146 
in stress-cnergy tensor. 275, 352 
in time-dependent field. 287 
Permanent magnets, 63 
Permeability, 63 
Perturbation theory, for scattering: 
by damped oscillator with radiation 
reaction, 212-216 
by dielectric. with dielectric constant 
near 1. 216-217 
Phase shifts. for scalar field. 
203-208 
Photons, equivalent. 150-156 
Planck radius, 312 
Plane: 
charged. 37 
wave, 109 
Plasma: 
dispersion law, 132 
electron, 130 
Poincaré group, 252 
Poisson's equation. & 
Polarizability: 
atomic systems. 95 
diamagnetic, 66. 75 
electrostatic dipole. 27-28 
magnetic dipole. 64 
quadruple. 28 
tensor, 29 
Polarization: 
circular, 104 
elliptic. 104 
linear, 104 
Polynomials: 
harmonic. 408 
Legendre: 
defined. 406-407 
generating function for. 407 
orthogonal, 409-4 [0 
Potential: 
coefficients of. 20 
complex. 240-241 
electrostatic. 4 
Liénard-Wiechert. 156-159 
magnetic, 48 
scalar, 134-137 
square well, 242 
vector, 56, 134-137 
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Poynting vector, 93. 160, 382 
Proca equation. 306 
Propagation: 

in anisotropic media. 114-118 

causal. 91. 109, 130 

scalar field, equation for. 182 

signal. 91 

of wave packet. 185 
Proper time. 261 
Pseudo-Euclidean system. focal 

transformation to, 323-325 

Pseudotensors, 260, 398-399 
Pyroelectric, 29 


Quadrupole: 
electric. 12. 40 
magnetic. 79 
radiation, 150, 380-384 
Quantum theory. interpretation of classical 
calculation. 153. 162 


Radial wave functions. 198-203 
Radiation: 
Bremsstrahlung. low-frequency. 159-165 
Cerenkov. 170-176 
cyclotron. 170 
dipole: 
electric. 146-150 
magnetic. 148 
electric quadrupole, 149 
equation. Green's functions for. 
137-140 
equivalent photons, 150-156 
by fixed frequency source, 140-144 
gravitational. 376 
emitted by a known source. 380 
quadrupole. 383 
Liénard-Wiechert. 165 
by prescribed sources, 134-180 
reaction, 212 
of sealar field. 190 
signature of. 141 
by slowly moving point particle. 
144-146 
Ratio, gyromagnetic. 59 
Reflection, 109-113 
Retraction. 109-113 
Resistance: 
of conductor. 48 
internal, 49 
Resolution. in particle scattering. 164 
Rest system, 256 
Retarded time, 157 
Run-away solution, 214, 312 
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Scalar density: 
Lorentz. 26l 
weight of. 317 
Scalar field: 
cross section for scattering of. 193 
defined. 182 
energy of. 183 
interacting. 301 
massive. 188 
optical theorem for, 194-197 
partial waves for, 203 
phase shifts for. 203 
propagation equation for, [82 
radiation of. 190 
scattering amplitude for. 205 
stress-energy tensor, 296, 301 
theory, construction of. 332 
Scattering: 
amplitude for scalar field. 205 
by conducting sphere. 24] 
by a damped oscillator. 212 
by a dielectric with g near |. 216 
electromagnetic field. 208-210 
tormulation of, 191-194 
length, 207 
low-frequency limit of. 216 
multipole. by a dielectric, 230-240 
optical theorem: 
for a scalar field, 194-198 
for light, 210-211 
partial waves, 203-208 
perturbation theory of. 211-217 
phase shifts, 203-208 
resonant. 214-215 
width for, 214-215 
of scalar field. 181-188 
Green’s function for. 188-191 
vector multipoles. 217-227 
of wave packet, £83 
Schrödinger equation, 96 
Shielding. electrostatic. 37 
Skin depth. 110 
Snell's law. 109 
Spacelike intervat. 254 
Sphere. charged. 36 
Spherical Bessel functions, 199 
Spherical harmonics, 39-40, 410 
completeness of. 415 
Stress: 
in electrostatic field. 12-16 
tensor, see Stress-energy tensor: Stress 
tensor 
Stress-energy tensor: 
canonical, 29t 


construction of covariantly conserved. 
symmetric. 293-295, 300-304. 326-331 
electromagnetic, 273 
for interacting scalar field, 296 
for matter and gravity. 371-374 
particle. 275, 352 
symmetric. construction of, 293 
Stress tensor. See also Stress-energy tensor 
magnetic. 72 
Maxwell electrostatic. 15 
particle, 352 
Summation convention. 391 
Superconductor, levitation of, 79 
Superluminal velocity. 254 
Sylvesters theorem, 11 


Tensor. see specific types of tensors 
angular momentum, 277 
defined, 396-398 
density, 263 
stress-energy-momentum, 273 

Thompson cross section. 216 

Three-body electrostatic forces. 2 

Timelike interval. 254 

Time reversal. 99 

Transformation: 
infinitesimal, 246 
Lorentz, 248-256 

of electric and magnetic fields. 267 
local, to a pseudo-Euclidean 
coordinate system, 323 


Uncertainty relation, 185 
Units: 
electrostatic. 2 
magnetic. 50 


Variational derivative, 282 
Variational principle. electrostatic field. 19. 
43 
Vector: 
algebra and analysis. three-dimensional, 
401-402 
contravariant, transformation of, 
313-314 
covariant. 254, 257 
density. Lorentz, 262 
energy-momentum, 255-256 
fields, 399-400 
multipoles, 217-227 
energy and angular momentum of, 
227 
potential. 56. 134-137 
Poynting. 93. 160. 382 


transformations. 391 
unit. 391 
Velocity: 
group, 185 
superluminal, 254 
Virtual present radius. 167 
Voltaic cell. 49 
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Wave packet: 
Propagation of. 185 
scattering of. 183 
Wave zone, 14] 
Weizsacker. F.. 156 
Width. of energy levels, 99 
Williams, J.. 156 
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A unique textbook on 
electromagnetism and gravitation 


This volume combines a novel approach with an accessible, down- 
to-earth treatment of electromagnetism and gravitation. It leads the 
student through classical electromagnetic theory, and introduces the 
gravitational field as a conventional second-rank tensor field. 


Clear, concise, and self-contained, this theoretical exposition 
focuses on basic principles rather than applications and avoids 
abstractions through a careful selection of topics. Classical Field 
Theory: Electromagnetism and Gravitation features 


+ Everything a student should know to grasp the fundamentals of 
classical field theory 


* Achapter on scattering that discusses material not readily 
available in other textbooks 


* Two appendices—one on vectors and tensors, the other on 
spherical harmonics—to review material recurring throughout the 
text 


e End-of-chapter exercises, some of which serve as mini-research 
problems 


Based on courses taught by the author at the Massachusetts 
Institute of Technology, Classical Field Theory is an excellent text for 
a two-semester first-year graduate course on electromagnetism and 
gravitation. 
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